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Multi-parameter singular Radon transforms I: the L^ theory 

Brian Street* 



Abstract 

The purpose of this paper is to study the L^ boundedness of operators of the form 

O: r 

(N . f^^{x)f{'ytix))Kit)dt, 

'H ; 

t^. where ■yt{x) is a C°° function defined on a neighborhood of the origin in (t, x) G R^ x R", satisfying 

■^^ ' 7o(a;) = I, ^/j is a C°° cutoff function supported on a small neighborhood of G R", and _ft' is a 

"multi-parameter singular kernel" supported on a small neighborhood of £ R^. The goal is, given 
an appropriate class of kernels K, to give conditions on 7 such that every operator of the above form 
is bounded on L^. The case when K is a Calderon-Zygmund kernel was studied by Christ, Nagel, 
Stein, and Wainger; we generalize their conditions to the case when K has a "multi-parameter" 
■^r ' structure. For example, when K is given by a "product kernel." Even when K is a Calderon- 

r ) [ Zygmund kernel, our methods yield some new results. This is the first paper in a three part series, 

. ■ the later two of which are joint with E. M. Stein. The second paper deals with the related question 

1 -p^ ' of L^ boundedness, while the third paper deals with the special case when 7 is real analytic. 
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1 Introduction 

The purpose of this paper is to prove the L^ boundedness of multi-parameter singular Radon transforms. 
These operators generalize the well-studied (single-parameter) singular Radon transforms. The study 
of the LP boundedness of these single-parameter singular Radon transforms cumulated in the work of 
Christ, Nagel, Stein, and Wainger [CNS W99] . They studied operators of the form, 

Tf{x)^yj{x)Jf{jt{x))K{t)dt, (1.1) 

where ip is a, C^ cut-off function, supported near G R", jt (x) — 7 {t,x) is a C°° map defined in a 
neighborhood of the origin in R^ x M" satisfying 70 (x) = x, and X is a standard Calderon-Zygmund 
kernel on M^ supported for t near 0. This means that K satisfies 

\d?K{t)\<\tr''-^"^, t^o, 

along with a certain "cancellation condition" which we will make precise later (see Definition 116. 4p |^ In 
[CNSW99] , appropriate "curvature conditions" on 7 were assumed to guarantee T extends to a bounded 
map LP ^- LP {1 < p < 00). See Section [5] for a further discussion of these curvature conditions. The 
goal of this paper is to replace K in (jl.ip with an appropriate multi-parameter kernel and develop 
conditions on 7 which will allow us to show T is bounded on L^. 

For instance, one might consider the case when K is a so-called "product kernel" supported near 0. 
To define this notion, suppose we have decomposed M.^ = M^^ x ■ • • x R^'^ , and write t = (f 1 , . . . , t^ ) e 
M^i X ■ • • X R^'" . A product kernel satisfies 

again along with certain "cancellation conditions" (see Definition 116.41) rl We will develop conditions on 
7 so that the operator T given by (|l.ip extends to a bounded operator on L^ for every such product 



^Actually, |CNSW99| restricts attention to homogeneous kernels K (and replaces the integral in HI. Il l with Jui^^, where 
a > is some small number). This restriction to homogeneous kernels is not essential to their work. 

■^The simplest example of a product kernel is K {ti , . . . , t,^) = Ki {ti) (^ ■ ■ ■ (^ Ki, (t^), where Ki, . . . , Ki, are Calderon- 
Zygmund kernels. 



kernel (relative to a fixed decomposition of M ). In addition, we will study kernels more general than 
these product kernels. When restricted to the single-parameter case, our results will imply the L^ results 
of [CNS W99) -and . in fact, generalize the L^ results of [CNS W99] . even in the single-parameter case (see 
Section [3] for a discussion of this) . 

This paper is the first in a three part series, the second two of which are joint with Elias Stein. 
The second paper, [SSllaj will investigate the corresponding L^ theory {1 < p < oo). The third paper, 
[SSllb| . will discuss, in detail, the special case when 7 is a real analytic function-a situation where more 
can be said. We give an overview of the results of |SSllb) in Section [51 to give the reader a view of this 
entire project. 

The outline of this paper is as follows. In Section [2] we discuss the sequel to this work |SSllb) . 
In Section [3] we discuss some special cases which motivate our main definitions and theorem. Sections 
|4][6] give all the definitions and notation necessary to state our main theorem. Section [7] includes the 
statement of the main theorem. Section |9] gives an overview of some of the main aspects of |CNSW99] we 
will use, along with some minor extensions that will be necessary for our proofs. Section [TOl gives a high- 
level overview of the proof. Sections [TTIfT51 are devoted to stating the technical results and definitions 
necessary for our proof. In Section [14] a general L^ theorem is stated and proved, which will imply our 
main theorem. In Section [TSj the general theorem from Section [TJj is used to prove the main theorem of 
the paper. In Section [THJ we offer a deeper study of the class of kernels we use. Finally, in Section [T71 a 
number of examples are given where the main theorem applies. 
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1.1 Informal statement of the main result 

In this section, we informally state the special case of our main theorem when K {ti, . . . ,t^) is a product 
kernel relative to the decomposition M.^ = R^^ x • • • x M.^" (see the introduction and Definition 116.41 
for this notion). We suppose we are given a C°° function, 7 (t, x) = 74 (x) € M" defined on a small 
neighborhood of the origin in {t,x) G M^ x R", satisfying 70 (x) = x. For t sufficiently small, jt is a 
diffeomorhpism onto its image. Thus, it makes sense to write 7(~ , the inverse mapping. We define the 
vector field 

7,t0 7-i(x)e7;M". 

For a collection of vector fields V, let V (V) denote the involutive distribution generated by V. I.e., 
the smallest C°° module containing V and such that ii X,Y € V (V) then [X,Y] G V{V). For a 
multi-index a G N^, write a = (ai, . . . , a^), with a^ E N^''. 

Decompose W into a Taylor series in the t variable, 

W{t,x)r^^t"Xc,. 

a 

We call a — (ai, . . . , a^) £ N^ — N^^ x • • • x N^" a pure power if a^ ^ for precisely one /i. Otherwise, 
we call it a non-pure power. 

We assume that the following conditions hold "uniformly" for 6 — {Si, . . . ,6,^) G (0, 1]" , though we 
defer making this notion of uniform precise to Section [5] 

• For every S G (0, 1]", 

Vs := V f |4" ■••^i"'''-''^ai,...,Q„ : iai,...,a^) is a pure powerlj 



W{t,x] 

de 



is finitely generated as a C°° module, uniformly in 6. 

• For every 6 G (0, 1]", 

WiSiti,...,S^U) €Vs, 

uniformly in S. 

Remark 1.1. If it were not for the "uniform" aspect of the above assumptions, they would be independent 
of 6. Thus it is the uniform part, which we have not made precise, that is the heart of the above 
assumptions. 

Our main theorem is. 

Theorem 1.2. Under the above assumptions (which are made precise in Section\S^, the operator given 
by 

!^^{x) j f{-ft{x))K{t) dt, 

is bounded on L^ , for every product kernel K (ti, . . . ,t,y), with sufficiently small support, provided tp has 
sufficiently small support. 

The precise statement of our main result (where more general kernels are considered) can be found 
in Theorems O and O 

2 Overview of the series 

One way to view this paper is as the foundation for the theory developed in the sequels where (in joint 
works with E. M. Stein) the corresponding L^ theory is developed |SSlla| . and the special case when 7 
is real analytic is investigated [SSllbj . In this section, we discuss the main results of }SSllb| . where 7 
is assumed to be real analytic. For the purposes of this overview, we restrict our attention to the case 
when i^ is a "product kernel," though more general cases will be covered in the later sections and in 
[SSllal ISSllbj . All of the results discussed here will be contained in jSSllb] . 

The setting is as follows. We decompose M.^ into v factors, R^ = M^^ x • • • x R^", and we write 
t = (ti, . . . ,ti,) G M^. We consider product kernels, K {ti, . . . , t^), relative to this decomposition (see 
the introduction and Definition 116.41 for the notion of a product kernel). In particular, when i> ^ 1, K 
is standard Calderon-Zygmund kernel. We suppose that we are given a germ of a real analytic function 
defined on a neighborhood of the origin in R^ x R"|£| 

7(i,x)=7t(a:):R^xR^'->R", 

satisfying 70 (x) = x. 

There are two, closely related, operators which are of interest to us: 

• Tf (x) — ip (x) J f {jti,...,t^ (x)) K {ti, . . . ,tv) dti • ■ • dt^, where K product kernel supported near 
t — Q and f/' is a C°° cut-off function supported near a; = 0. 

• Mf {x) = '0(a;)supo<5j^ ,5_^<<i j^j|<j |/(7<5iti,...,i5^t„ {x))\ dti ■ ■ -dt^, where V' is a C°° cut-off func- 
tion supported near x — 0, and the supremum is taken over 5i, . . . ,5u small. 

Note that the above operators only make sense if K and '0 are supported sufficiently close to 0, since 7 
is only defined on a neighborhood of 0. Our goal is two-fold: 

(I) Give conditions on 7 such that T is bounded U' {1 < p < 00) for every such product kernel. 

(II) Give conditions on 7 such that M. is bounded on V {\ < p < 00). 



^Here, we have used the notation / : Mq — >■ R™ to denote that / is a germ of a function defined on a neighborhood of 
gR". 



This paper is only concerned with Goal |T] in the case p — 2, though Goal HIl is an essential part of 
ISSllallSSTTb) . 

In thispaper and |SSlla] , we consider the more general case when 7 is C°° . When 7 is C°° , we put 



USD 



two typeqj of assumptions on 7 to ensure that T and M are bounded on L^: 

• A "finite type" condition. This condition can be seen as a generalization of the condition in 
[CNS W99] ■ where the case i/ = 1 (i.e., when _ftr is a Calderon-Zygmund kernel) is treated. 

• An "algebraic" condition. This condition holds automatically in the case i^ = 1 (i.e., when if is a 
Calderon-Zygmund kernel). 

For a discussion of these two conditions in an easy to understand special case, we refer the reader to 
Section [3l When we restrict attention to the case that 7 is real analytic in [SSI lb) , we will use a 
Weierstrass-type preparation theorem (from |Gal79) ) to help show: 

• The "finite-type" condition holds automatically when 7 is real analytic. 

• The "algebraic" condition is not necessary for the L^ boundedness of A^ when 7 is real analytic 
(though it is necessary, in some cases, for the L^ boundedness of T). 

Because of this, our results take a much simpler form when 7 is assumed to be real analytic. We state 
them here. 

Theorem 2.1 ( |SSllbj ). When 7 is real analytic, M is bounded on L^ (1 < p < cx)^ under no additional 
assumptions. 



Remark 2.2. There are at least two special cases of Theorem 12.11 in the literature. The first is due to 
Bourgain |Bou89| in the case v = 1. He studied the case 7 (i,x) : Rj x Rq — ?► M^ given by 7 (i,x) = 
X — tv (x), where z; : Rq ^' R^ is a real analytic vector field. Another special case of Theorem 12.41 is due 
to Christ |Chr92) , where left invariant maximal functions (of a certain form) on a nilpotent Lie group 
are studied. 

We now turn to T. We begin by stating the corollary of our general result in the special case v = 1, 

Corollary 2.3 f [SSllb] ). When v ~ \ (i.e., when K is a Calderon-Zygmund kernel) and when 7 is 
real analytic, T : L^ ^ L^ (1 < p < cx)J under no additional assumptions. This result is a special case 
of Theorem \2.4\ below. 



Unfortunately, when we move to the case 1/ > 1, it is necessary to put additional hypotheses on 7 
to ensure that T is bounded on L^ (or even L^), even when 7 is real analytic. Examples demonstrating 
this can be found in Section [17.51 

To explain our assumptions on 7, we must first introduce some notation. In |CNSW99] it was shown 
that every 7 could be written in the form, 

7ti,...,t„ (2;) ~ exp ^ t"^ ■■■t"''Xai....,aA X, 

\|qi|+--- + |q„|>0 / 

where the Xai,...,a^ are vector fields and we write 74 (x) ^ exp (^^ f^Xa) x if 74 (x) = exp I X]u|<l ^"^a 

o(|t|^). 

To each A'aj_...,Q_^ we assign the formal degree da^^,_^a^ = (lo^il : • • ■ 7 \oi-A)- We say that da is a pure 
power if da is nonzero in precisely one component, otherwise we say it is a non-pure power. Define two 
sets 

V = {{Xa^da) : do, is a pure power} , 

TV = {(Xa^da) : do, is a non-pure power} , 

Let S be the smallest set of vector fields with formal degrees such that. 



*We do not make the dichotomy between these two types of conditions expUcit in this paper: it is only particularly 
relevant when 7 is real analytic, and we defer a precise discussion of these issues to ISSllb| . 



• VCS, 

• If (Xi, di) , {X2, d2) e 5, then ([Xi, X2] , di + da) G 5. 

Theorem 2.4 ( [SSllb] ). Let 7 6e reaZ analytic. Suppose that, for every (Y, e) G A/", i/iere is a neigh- 
borhood U — U (Y, e) of and vector fields (Xi, di) , . . . , {Xq, dq) G S such that for every 5 G [0, 1] , we 

hav^ 

<i 

1 \^ - 2_^C,jO ^j|y, 

where {rf : (5 G [0, 1]"} C C°° (C/) is a hounded set. Then T is bounded on L^ (1 < p < ooj for every 
product kernel K (with sufficiently small support), provided ip has sufficiently small support. 

Remark 2.5. Actually, the result in [SSllbj requires less than is outlined in Theorem 12.41 the set U can 
depend on 5 in a specific way, and one requires less than {c|} forming a bounded set. However, while 
this stronger result is important for applications, it requires a good deal of notation to state precisely. 
We, therefore, defer the discussion to [SSllbj . 

Remark 2.6. Note that if F = for every (Y, e) G N, then T is bounded on L^, as the conditions of 
Theorem 12.41 hold trivially. In particular, in the case v = 1, M — % and CoroUarv 12.31 follows. 

Remark 2.7. One of the main aspects of jSSllb] is showing that Theorem 12.41 (and its more general 
analog alluded to in Remark 1 2. 5 p are a special case of Theorem 17.11 below (at least when p = 2; more 
general p are studied in [SSllaj V 

3 A Special Case 

Before we enter into the rather lengthy statement of our main result, we discuss a special case. Namely, 
the case when, 

7( (x) = e^o<i°i<'>-f *°-^°a;, (3.1) 

where each X^ is a C°° vector field-note that this is an exponential of a finite sum of vector fields. 
Later in the paper, we will deal with more general 7; see Remark l3.6l for some comments on how we will 
do this. The goal in this section is to discuss conditions we may place on the Xa so that the operator 
given by (|l.ip is bounded on L^ . The conditions will depend on the type of kernel K which is used in 

(ED- 

We begin by discussing the work in [CNSW99] in this special case. Then, we discuss a way in 
which our results generalize the results in |CNSW99] in the single-parameter setting. We then close by 
informally discussing a special case of the multi-parameter setting. 

In [CNSW99J . operators of the form, 

f^i^{x)jf{-ft{x))K{t)dt, (3.2) 

were studiedjj where 1/' is an appropriate cut-off functions, and K \s a, Calderon-Zygmund kernel with 
small support (how small depends on 7). One of the main aspects of [UNSW99] was exhibiting conditions 
on 7 for which the operator given by (|3.2p is bounded on L^ (1 < p < cx)). In the special case when 7 
is given by p.ip . this condition is that {Xa} satisfy Hormander's condition. I.e., that the Lie algebra 
generated by the vector fields {^q} span the tangent space at every point of the support of if. 

Now suppose we are no longer in the situation where {-'^q} satisfy Hormander's condition. Instead, 
let V be the involutive distribution generated by the Xa. I.e., let V be the C°° module generated by 
the Xa and the commutators of the Xa of all orders. Suppose that V is finitely generatecO as a C°° 



5 We write S -^i = U'^^.^i V ■ 

^ |CNSW99) did not restrict attention to 7 of the form l|3.1| l: nor will we. However, it is easier to understand the basic 
ideas in this special case. For the connection between ICNSW99] and ours when 7 is more general, see Section 117.31 where 
a there is a proof of how the framework of |CNSW99] fits into our more general setting. 

'^Usually, one sees the condition that X> is locally finitely generated. However, our results are local in nature, and so it 
suffices to assume that X> is finitely generated. 



niodule|3 Note, if Vx is the vector space given by evaluating all of the vector fields in V at the point 
X, then we are not assuming dini'Dx is constant in x. The classical Frobenius theorem applies to show 
that IR" is locally foliated into leaves, where the space of smooth sections of the tangent bundle of each 
leaf is given by V. Since the dimension of dimP^; is not necessarily constant, this could be a singular 
foliation; i.e., the dimension of the leaf may vary from point to point. It is clear that, when restricted 
to each leaf, {X^} satisfy Hormander's condition. 

One might hope, given the above discussion, that one could apply the theory of |CNSW99] to 
each leaf, and then put this all together to obtain the L^ (or, in our case, L^) boundedness of p.2p . 
Unfortunately, one runs into a technical difficulty: near a singulai|j point of the involutive distribution 
2?, the classical proofs of the Frobenius theorem do not yield uniform enough control of the coordinate 
charts defining the leaves to be able to apply the theory of |CNSW99) uniformly on each leaf. 

Fortunately, one can obtain uniform control of these coordinate charts. Indeed, this was one of the 
main aspects of [Strllj . In fact, the theory in [Strllj will be the main technical tool on which we will 
base all of our "scaling" arguments (see Sections [5] and [Til for more on IStrlip P^I Because of this, in 
the above case, we will obtain the L^ boundedness of (|3.2p in this paperFH 

Remark 3.1. In the special case when the X^ are real analytic, the distribution V is automatically 
finitely generated as a C°° module (when restricted to a sufficiently small neighborhood)!^ This gives 
intuition as to why Corollarv l2.3l is true. 

Let us rephrase the above single-parameter theory in a slightly more complicated way, which will 
lead us naturally to the multi-parameter theory. In what follows, we take the standard dilations on 

t = (ii,...,iAr)eM^: 

St^{dti,...,dtN), (5e(o,i]. 

In later sections of the paper, we will raise S to different powers in each coordinate, but we ignore such 
generalizations for the moment. One of the main aspects of Calderon-Zygmund kernels is that the class 
of kernels is dilation invariant: 

is again a Calderon-Zygmund kernel if K is (and this is true uniformly in S). To take advantage of this 
dilation invariance, we wish to restate the above condition that the involutive distribution, V, generated 
by {Xa} is finitely generated as a C°° module, in a naturally dilation invariant way. 
Note that. 

Our dilation invariant version of the above assumption is that the involutive distribution generated by 
{(5l"IXa} is finitely generated "uniformly" for S £ (0, 1] (of course for every S the involutive distribution 
so generated is always equal to D, so it is the uniform aspect that is the point). We make this notion 
of uniform precise in a moment. We will see, in the single-parameter case (which we are presently 
considering), this uniformity follows for free (Lemma l3.2|) . but this is not the case in the multi-parameter 
setting (see Section fl 7. 7|) . 

Assign to each vector field Xa the formal degree \a\. Recursively, assign formal degrees as follows: if 
Xi has formal degree di and X2 has formal degree d2 , then we assign [Xi , X2] the formal degree di + d2 
(it is possible that the same vector field may have more than one formal degree). Scaling Xa by (S'"' 
induces a scaling 6"^^ Xi where Xi has formal degree di . 

Our "uniform" assumption in S is as follows. We assume that there is a finite list of the above vector 
fields Xi, . . . , Xq, Xj having formal degree dj, each {Xa, \a\) appearing as some {Xj, dj), and this list 



°This is a generalization of Hormander's condition. Indeed, Hormander's condition states that the involutive distribu- 
tion generated by the Xa is the entire space of vector fields (on a neighborhood of the support of ip). This distribution is 
clearly finitely generated: it is generated by the coordinate vector fields. 

^A point X is said to be singular, if dimC^; is not constant on any neighborhood of x. 
In particular, the results in |Strll) allow us to use multi-parameter Carnot-Cartheodory balls, without resorting to 
the weakly-comparable hypotheses in Section 4 of ITW03I . by introducing some extra assumptions on the relevant vector 
fields. This is an essential point of our analysis, and might also be useful in problems related to the ones in ITW03| . 
^'^The corresponding L^ (1 < p < 00) boundedness holds as well, and will be covered in the sequel to this paper ISSlla| . 
^^This is a result of Lobry ILobTOI . and can be seen as a consequence of the Weierstrass preparation theorem. See 
| Lob70l ISSllb| for a further discussion. 



satisfying, 

[S^^X^S^'-Xk] = J2 41'^'''^'' ^ e (0, 1] , (3.3) 

1=1 

where c •'^. is a C°° functioio uniformly in S. Note that, by taking S — 1, (|3.3p imphes that X* is a finitely 
generated distribution (it is generated by Xi , . . . , Xq as a C°° module) . We also have the converse, 

Lemma 3.2. One may choose {Xi,di) , . . . ,{Xq,dq) as above, so that (13.31) holds, if and only if V is 
finitely generated as a C°° module. 

Proof. As noted above, the only if part is clear. To prove the converse, we will prove a stronger form of 
p.3p . Indeed, we will show that we may select {Xi,di) , . . . , {Xq,dq) as above so that, 

[X„Xk]^ J2 4,fe^'' (3-4) 

di<dj+dk 

where c' j, are C°° functions. Note that p.4p implies p.3p . as one may take cA = J'^j+^fc^'^'c'. ^, 
when dj + dk > di, and otherwise. Incidentally, p.4p was first introduced in |NSW85| to study 
vector fields which satisfy Hormander's condition. Suppose Xi, . . . ,Xr are generators for 2? as a C°° 
module. We may assume that each Xj arises as an iterated commutator of the Xa. Thus, each Xj, 
j = 1, . . . ,r, has assigned to it a formal degree dj. We may also assume that each Xa appears in the 
list Xi, . . . , Xr. We let (Xi, di) , . . . , {Xq, dq) be an enumeration of all vector fields with formal degree 
dk such that dk < maxi</<r di (here we have taken Xi, . . . , Xr to be the first r elements of Xi, . . . , Xq). 
We claim that this list: {Xi, di) , . . . , {Xq, dq) satisfies (|3.4p . Indeed, if we take the commutator [Xj, Xk] 
(1 l£ j,k < q), there are two possibilities. The first possibility is that dj + dk < niaxi<;<rd/. In this 
case ([Xj, Xfe] , dj + dk) already appears in the list {Xi, di) , . . . , {Xq, dq) and so (|3.4p is trivial. On the 
other hand, if dj + dk > niaxi<;<r di, then we use the fact that, 

r 

[Xj,Xk] = 2l4,fc^'' 
1=1 

as this immediately follows from the fact that Xi, . . . , X^ generate P as a C°° module. Now (|3.4p follows 
immediately. D 

Now that we have stated our single-parameter assumptions in a dilation invariant manner, we are 
prepared to state a multi-parameter result. For the purposes of this introduction, we restrict our 
attention to the two-parameter situation, but we will see later in the paper that all of these ideas extend 
to any number of parameters. 

We will be considering operators of the form 

f^^P{x)J f (7(.,t) {x)) K {s, t) ds dt, (3.5) 

where {s,t) £ R^^ x 'EJ^^ . Here K {s,t) is a product kerneo (supported near s = /; = 0). That is, K 
satisfies, 



afafx(s,i) 



< ur^^-i"! |i|-A'2-i/3i 



along with certain cancellation conditions. See Section [16] for more precise details. The relevant dilations 
in this situation are two-parameter. For {5i,52) G [0, 1] , we define, 

{5i,52){s,t)^{5is,52t). 



^^When we present our assumptions on the vector fields in more detail (see Section [Sjl we will only require l|3.3| l to hold 
locally near each point in a precise sense, and will require less than the functions being C°° . We defer such details to 
Section [6] 

'^^For our main theorem, we will discuss kernels more general than product kernels. See Sections l4l and [T6l 



We consider 7 of the form, 



— pZ]o<|Q| + |/3|<Af «"* ^a 



l{s.t) {x) = e^«<i°i+i'' 



'X, 



where, as before, the Xa.p are C°° vector fields. Dilating gives, 

l(Sis,S2t) [x) = e^«<i°i+i'3i<Af 1 2 =.32._ 

In analogy with the single parameter case, we assign to the vector field Xa_p the two-parameter formal 

degree da^p = {\a\ , |/3|). Thus, scaling Xa^p by S'^'Sif is the same as scahng it by {61,62) "'"j where we 
think of da^p as a multi-index. 

In the two-parameter situation, a new phenomenon appears. We must separate our vector fields into 
two sets and view these two sets differently. To define this, we call da.p a pure power if either |a| or |/3| 
is 0. Otherwise we call it a non-pure power. We define two sets, 

P ~ {{Xa,p,da,i3) ■ da^p is a pure power} , 
A/" — {{Xa.p,da^p) : da^p is a non-pure power} . 

In the single-parameter case, every power was a pure power, but in the multi-parameter case, we must 
deal with the set A/" in a different manner. 

First we discuss the case when Xa^p — for every (X^.p^da^p) G Af. This case works in much the 
same way as the single-parameter case. We recursively define formal degrees on the iterated commutators 
of the elements of V as follows. If Xi has formal degree di G N^ and X2 has formal degree ^2 G 
N^, we assign to [Xi,X2] the formal degree di + d2- Our assumption is that there is a finite list 
{Xi,di) , . . . , {Xq, dq) of these vector fields, containing the set P, such that for every 6 G [0, 1] , we can 
write, 

[5'^^X„(5'''=X,]=^4;t<5'^'Xz, (3.6) 

where c A G C°° uniformlM^^I in 6. Note, by taking 62 — 0, (|3.6p implies that the vector fields {X^.o} 
satisfy the single-parameter assumptions. Similarly, by taking 61 = 0, p.6p implies that the vector 
fields {Xq p} also satisfy the single parameter assumptions. Moreover, (j3.6p implies more: it implies a 
condition on the commutators between the {X^.o} and the {Xq^}. 

More generally, when there is some nonzero vector field in A/", we proceed as above, obtaining the 
list {Xi,di) , . . . ,{Xq,dq) from P and assuming it satisfies p.6p . We further assume that for every 
{Y, e) ^ N and every 6 G [0, 1] , we have, 
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6<'Y =y c^Y 6"^' ^i^ (3-7) 



with Cy G C°° uniformly in 6. See Section 117.51 for an example of how LF' boundedness can fail if we 
do not have p.7p and Remarks 110.31 and 112.151 for how p. 71) is used in the proof. 

Our main theorem is a generalization of the following: under the above hypotheses, the operator 
given by (|3.5p is bounded on Li^ . 

Remark 3.3. It may not be obvious that our main theorem generalizes the above. The fact that it does 
is discussed in Section [17.11 

Remark 3.4. As mentioned before, the obvious analog of Lemma 15^ in the multi-parameter setting does 
not hold. This is discussed in Section [17.71 

Remark 3.5. Our assumptions above on the set P (i.e., the existence of (Xi,di) , . . . , [Xq,dq)) can be 
thought of as the "finite type" assumptions discussed in Section [21 The assumptions on the set J\f can 
be thought of as the "algebraic" assumptions. As mentioned in Section [2 the "finite type" assumptions 
hold automatically when the X^ are real analytic. This is proven in [SSI lb) . 

-"^^ Again, wc have just stated a speeial ease of our assumptions on c .',. for simplicity. 



Remark 3.6. In the rest of the paper, we work with more general functions 74(2^), not necessarily 
assuming that 74 (x) = e^* -'^"x. If one were to assume that 74 (x) = e^^^'x for some C°° vector 
field depending smoothly on t with A{0) = 0, then it would not be difficult the generalize the above 
assumptions. Unfortunately, not every 7 is of this form. Fortunately, there is a convenient alternative. 
Since 70 (x) = x, 74 is a diffeomorphism onto its image, for t sufficiently small. Thus, we may define. 



^(M) = | 



7et o 7j (x) g T^ 



Note that W (t) is a vector field, depending smoothly on t, satisfying W (0) — 0. Moreover, it turns out 
that the map 7 i— >■ VF is a bijection (see Proposition II 2. l"|) . The inverse mapping is given by solving an 
ODE. This ODE is similar to the one which is used to define the exponential map. We will see that 
everywhere we wish to use the exponential map, it can be replaced by the inverse mapping W^ H> 7. 
Thus, in what follows, we will exhibit analogs of the above conditions applied to the vector field W (t). 

4 Kernels 

In this section, we define the kernels K for which we will study operators of the form (jl.ip . The kernels 
we study will be distributions on R^, and they will be supported in B^ (a) = {x G M^ : \x\ < a}, where 
a > is a small number to be chosen later|l£| Fix i^ G N. We will be studying i^-parameter operators. 
Fix a subset A C [0, 1]"^ such that if 61,62 G A, then ^1 V ^2 G A. 

We suppose that we are given z^-parameter dilations on R^. That is, we are given e = (ei, . . . , ejv) 
with each ^ e^- = (e], . . . , ej) G [0, 00)''. For S e [0, 00)" and t = (ii, . . . , In) G R^, we define 

St = {S''Hi,...,6'"tN), (4.1) 

thereby obtaining 1/ parameter dilations on M.^ . 

The class of distributions we will define will depend on N, a, A, and e. At a first reading, the reader 
may wish to take A ~ [0, 1] , as this case contains all the main ideas. 

We will make two passes at the definition of the class of kernels. First, we will define a class of kernels 
which we will denote hy IC — JC {N, e, a. A). This class will be simple to understand, and contains all of 
the main ideas. Our proof, however, will works for a slightly larger class of kernels JC {N, e, a. A). This 
class of kernels is important for applications, and so we state our results in this context. On a first pass, 
the reader may wish to just understand the proof in the case of /C, however. 

Remark 4.1. Actually, the class IC is much simpler to understand in the case when A — [0,1] . See 
Lemma [16.21 for details. 

4.1 The class JC 

For reach /i, 1 < /i < z^, and given i = (ti, . . . ,ijv) G R^, let i^ denote those coordinates tj of t such 
that e^ 7^ 0. 

Given a function <j on M^ , and j G N'"' , define 

Note that <j^^ '' is defined in such a way that 

^(2')(t) dt^ f<;{t) dt. 

Let - log2 A^ {j eW : 2"^ G A}. Note that if ^ := [0, 1]", then - loga A = W. 



^"For one thing, we will always choose a > so small that for \t\ < a, ■yt is a diffeomorphism onto its image; so that 
7(~ makes sense. 
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Definition 4.2. Wc define /C = /C {N, e, a, A) to be the set of all distributions, K, of the form 

K^ E 4"^ (4-2) 

where {<;j} C C^ [B^ (a)) is a bounded set, satisfymg 

<^j (i) di^ = 0. (4.3) 



The convergence in (|4.2p is taken in the sense of distributions. We will see in Lemma [16.11 that every 
such sum converges in distribution. 

Remark 4.3. Given N, e, and A, we will define a class of functions 7t such that the operator given by 
(jl.ip is bounded on L^ for every K € K. {N, e, a, A) for a > sufficiently small. Note that ii A Q B, then 
K. {N, e, a, A) Q K. {N, e, a, B). However, our theorem will apply to a larger class of 7 corresponding to 
A than it will corresponding to B. 

4.2 The class /C 

While the class K. contains all of the main ideas, in many applications the kernels that arise are in a 
slightly larger class, which we denote /C (iV, e,a,.4). Indeed, we will see the benefits of working with 
this class in Section [121 The main point, here, is that in our proofs we do not use the full cancellation 
condition (|4.3p . Thus, the definition of the class /C will be the same as that of AC, but we will weaken 
the condition (|4.3p . 

Before we can define IC, we need some new notation. Fix a constant C > 1, and j — (j'l, . . . , j,y) G 
— log2 A. For 1 < /i < I', we say fx is j, ^-minimal, if there does not exist a fc e — log2 A such that 
Jii > kfi. For 1 < /.ti, /i2 < z^, we write 

Ml dij,c,A M2 

if for every k — {ki,. . . ,k^) e — logj A with j^^ > fc^^, we have j^^ — fc^^ < C {j^^ ~ ^1^2)- We define 

[Mi]i,c,.A = {^2 : A^i d:j,c,A f^2 and /i2 ^j,c,^ Mi} ■ 

Note that /xi e [/^i] p_^. Also, we define t^ '"'^'^ = i^'^)ne\u 1 ' '^here repeated coordinates are only 
included once. I.e., t^ ^"^■^ consists of those coordinates tj such that e^ 7^ for some ^ e [^1] ^^. We 
let ^2 ^'"^"^ be the rest of the coordinates, so that t — (t^ ^''^"^ , ^2 ^''^"^ ) • Finally, we say fii ~<j^c,A M2 
if /ii :<j^c,A fJ-2 but we do not have /i2 ^j,c,^ Mi- 

Definition 4.4. We define K. — IC {N, e, a. A) to be the set of all distributions, K, such that there exists 
a C > 1 such that K can be written in the form 

K= T. .f\ (4.4) 

je-iog^A 

where {<jj} C C^ {B^ (a)) is a bounded set, satisfying the following cancellation conditions: for every 
j G — log2 A, and every /u such that /i is not j, ^-minimal, and there does not exist a /i' with ^ ^j,c,yl m'j 
then we assume 

^, (t) d^'''-^'^ = 0; (4.5) 

otherwise, we allow the integral to be possibly non-zero. 

Remark 4.5. Note that it is evident that /C {N, e, a,A}^IC {N, e, a, A). 

In Section (Tni we further discuss the class /C, though the results there will not be used in the rest of 
the paper. In particular, we discuss following facts: 
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1. Every sum of the form (|4.4|) converges in the sense of distributions. 

2. So e/C(7V,e,a, [0,1]''). 



3. Product kernels (in particular Calderon-Zygmund kernels) and flag kernels (see Definitions 116.41 
and 116.61 and |NRS01| ) are special cases of fC. Indeed, product kernels (with support in B^ (a)) 
are the case when each Cj is nonzero in precisely one component, and A = [0, 1] (this is 
a result of |NRS01j ). Similarly, fiag kernels are the kernels with the same e but with A — 
{S £ [0,1]'^ : Si > S2 > ■ ■ ■ > Sv}- Note that, as in Remark 14. 3| our main theorem will apply to 
a larger class of 7 when considering K a flag kernel, than it will when considering K a product 
kernel. See Section FIT. 61 for more details on this. 

4. There are interesting kernels which are neither product kernels nor fiag kernels which are of the 
form AC {N, e, a, „4)-see the end of Section [16] for an example. 

5 Multi-parameter Carnot-Caratheodory geometry 

At the heart of the conditions we will assume on 7 lies multi-parameter Carnot-Caratheodory geometry. 
Thus, before we can even define the class of 7 we will study, it is necessary to review the relevant 
definitions of multi-parameter Carnot-Caratheodory balls. We defer the theorems we will use to deal 
with these balls to Section [TT] Our main reference for Carnot-Caratheodory geometry is [Strllj . and 
we refer the reader there for a more detailed discussion. 

Let H. C M" be a fixed open set, and suppose Xi, . . . , Xq are C°° vector fields on il. We define the 
Carnot-Caratheodory ball of unit radius, centered at xq G fi, with respect to the list X by 



Bx (xo) -.^ {yen 



37:[0,l]->f7,7(0)=a;o,7(l)=J/, 

y (i) - E «^- (*) ^1 (^ (*)) ' «J- e ^°° ([0' 1]) ' 



E I 

A<]<q 



< 1 



L-([04]) 

Now that we have the definition of balls with unit radius, we may define (multi-parameter) balls of 
any radius merely by scaling the vector fields. To do so, we assign to each vector field, Xj, a (multi- 
parameter) formal degree ^ dj = (d], . . . , dj) € [0, cxd)''. For S = {Si, ... ,6^) S [0, 00)'', we define the 
list of vector fields 6X to be the list (^5'^^Xi, . . . , S'^'iXqj. Here, S'^^ is defined by the standard multi-index 

notation: iJ'^j = Ilu^i '^m^ • ^'^ define the ball of radius 6 centered at xq E fl by 

B{x,d} {xo,S) := Bsx (xq) . 

At times, it will be convenient to assume that the ball B(^x,d) {xo, S) lies "inside" of il. To this end, 
we make the following definition. 

Definition 5.1. Given xq E fl and fi' C fi, we say the list of vector fields X satisfies C {xq, H') if for 
every a = (ai, . . . ,a,) G (L°° ([0, l]))^ with 



Nil 



L~([0,1]) 




<1, 



i°°([0,l]) 



there exists a solution 7 : [0, 1] — > ft' to the ODE 



7'(t) = E«jW^^(^W)' 7(0)=xo. 
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Note, by Gronwall's inequality, when this solution exists, it is unique. Similarly, we say (X, d) satisfies 
C {xo, 6, n') if SX satisfies C (xo, ft'). 

One of the main points of |Strll) was to provide a detailed study of the balls B(^x,d) {xo,S), under 
appropriate conditions on the list {X,d). To do this, we first need to pick a subset A C [0, 1]'', and a 
compact set Xq d il|^ Wc will (essentially) be restricting our attention to those balls B(^x,d) (^0,^) 
such that xq £ Kq and S ^ A (this A is the same set as in Section [4]). 

Definition 5.2. We say {X, d) satisfies V {Kq, A) if the following holds: 

• Take fl' with Kq d fJ' d fJ and ^ > such that for every S £ A and x € Kq, {X,d) satisfies 
C{x,^S,n'). 

• For every 6 £ A and a; G Kq, we assume 

[S'^^X.,, S'^^X,] = Y. '^f'S^'-Xk, on B^xM) (x, m ■ 

k 



For every ordered multi-index a we assume^ 



sup 

x£Ko 



< oo. 



If we wish to be explicit about il' and ^, we write V {Kq, A, fl',S.)- 

It is under condition 'D{Kq,A) that the balls B^x,d) (2;,<5) were studied in [Strllj . We refer the 
reader to Section [TT] for an overview of the theorems from [Strll] that we shall use. 

In what follows, we will not be directly given a list of vector fields with formal degrees satisfying 
'D{Ko,A), 

{Xl,dl) , . . . , (Xq,dq) , 

but, rather, we will be given a list of C°° vector fields with formal degrees which we will assume to 
"generate" such a list. 

To understand this, let {Xi,di) , . . . ,{Xr,dr) be C°° vector fields with associated formal degrees 
^ dj £ [0, oo)". For a list L — (Zi, . . . , /„) where 1 < Ij < r, we define, 

Xl = ad {Xi,)a.d (Xi,) ■ • ■ ad {Xi„^_,) X,,„, 
dL ^ di^ + di.^ -\ h (i;,„ . 

We define S = {{Xl, d^) : L is any such list} . 

Definition 5.3. We say S is finitely generated or that {Xi,di),. . . , {Xr,dr) generates a finite list if 
there exists finite subset, J^ C S, such that T satisfies V {Kq, A]i!3 and 

{Xj,dj)eT, l<j<r. 

If we enumerate the vector fields in J-, 

T={{X,,di),...,{X„d,)}, 

we say that {Xi, di) , . . . , {Xr, dr) generates the finite list {Xi,di) , . . . , {Xq, dq). Note that, if 5 is finitely 
generated, {Xi,di) , . . . {Xr, dr) could generate many different finite lists. However, if we let {X, d) and 
{X' , d') be two different such lists then either choice will work for our purposes. In fact, it is easy to see 
that {X, d) and {X' , d') are equivalent in the sense discussed in Section lTl.il It follows from the discussion 
in Section fll. 11 in every place we use these notions, it will not make a difference which finite list we use. 
Thus, we will unambiguously say "(Xi, di) , . . . , {Xr, dr) generates the finite fist {Xi,di) , . . . , {Xq, dq)," 
to mean that {Xi, di) , . . . , {Xr, dr) generates a finite list and (^i, di) , . . . , {Xq, dq) can be any such list. 

^'^One should think of Kg as the closure of a small open set, on which the operator of study is supported; i.e., the 
Schwartz kernel of the operator we are studying will be supported in Kg X Kg . 

^**We write ||/||(;70((7) := sup^gjj |/ (x)|, and if we say the norm is finite, we mean (in addition) that / is continuous on 

^^Here, we are thinking of Kg and A fixed. 
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Remark 5.4. We can rephrase the above in the following way. We iteratively take comniutators of the 
XjS and assign formal degrees as above. The assumption that (Xi, di) , . . . , {Xr-, dr) generates a finite 
list says that after a finite number of steps, one "does not get anything new." For an example of how a 
list {Xi,di) , . . . , {Xr, dr) might not generate a finite list, see Section [17.61 

6 Surfaces 

In this section, we define the class 7 for which we will study operators of the form (|l.ip . The functions 
we study generalize the ones studied in |CNSW99l : even in the single parameter case, when K is a. 
Calderon-Zygmund kernel, our class of 7 is somewhat wider that those studied in |CNSW99] (cf. the 
special case in Section [3]). 

In |CNSW99] a number of equivalent "curvature conditions" on 7 are defined under which the single- 
parameter version of operators of the form (jl.ip are studied. We discuss some of these conditions in 
Section ini Unfortunately, the most convenient way (for our purposes) to look at these conditions was 
not directly studied in |CNSW99] . In Section [HI we define and discuss a new curvature condition, (Cz), 
which is equivalent to the other curvature conditions in [CNSW99]. In this section, we jump straight to 
the generalization of (Cz) to the multi-parameter situationo 

We assume that we are given an open subset H. C R" and a subset ^ C [0, 1]"^ (such that if 61,62 G A, 
then 6i\/ 62 & A), and i/-parameter dilations e as in Section |4l 

Definition 6.1. Given a multi-index a G N^, we define, 

N 

deg (a) = 2_. ^j^j £ [0, 00)"^ . 

Let i^o (E ri' (E ri" (£ J7 be subsets of 57 with Kq compact and il' and Vt" open by relatively compact 
in J7. Our goal in this section is to define a class of C°° functions 

7(i,x) :B^(p) xn" ^Q. 

such that 7 (0, x) = x. Here p > is a small number. This class of functions will depend on A, N , and e 
(it will also depend on Kq, Q,, 51', and Q,"). This class will be such that if -0 is a C^ function supported 
on the interior of Kq, then there is an a > 0, sufhciently small, such that the operator given by (jl.l|) is 
bounded on L^ for every K G IC {N, e, a. A). 

As in Section [U we will write 7^ (x) = ^(t,x). Note, by possibly shrinking p > we may assume 
that for every t € B^ (p), 74 is a diffeomorphism onto its image. From now on we assume this, so that 
it makes sense to write 7j~ throughout the paper. 

Unlike the work in |CNSW99l . we separate the conditions on 74 into two aspects. For the first, 
suppose we are given a list of C°° vector fields on 51", Xi,. . . ,Xq with associated r/-parameter formal 
degrees di,...,dq satisfying V {Kq, A, 51', £) for some ^ > (we will see later where these vector fields 
come from). 

Definition 6.2. Suppose we are given a C°° vector field on U," , depending smoothly on t G B^ (p), 
W (t,x) G Tj;51". We say that {X,d) controls W {t,x) if there exists < pi < p and < ti < ^ such 
that for every xq G Kq, 6 & A, there exists functions cf°'' on B^ (pi) x B(x.d) (^Oi ''"1'^) satisfying, 

• W {5t, x) = Y^l^^ cf "'"^ (t, x) 6'^'Xi (x) on B^ (pi) x Bi^x.d) (^0, ti6), where 6t is defined as in (|4T|) . 



SUPxoSKo E|a| + |/3|<m i^^) ^tC 



5eA 



\a fiP xo,S 



-I 



< 00, for every m. 

CO(B"(pi)xi3(x,d)(a;o,Ti5)) 



^"strictly speaking, what follows is not a straight generalization of the curvature conditions in ICNSW99] . As mentioned 
in Section [3] our results will hold for a larger class of 7 in the single-parameter case. However, it is reasonable to think of 
what follows as the multi-parameter "generalization" of the curvature conditions in |CNSW99] . 
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Definition 6.3. We say {X,d) controls 74 (x) if {X,d) controls W {t,x), where, 

7ct ° 7*"^ (x) ; 



Wit,.) = ^^ 



here, et = (eii, . . . , ei„) and so has nothing to do with the dilations e. 

Part of our assumption on 7 will be that a particular list of vector fields {X, d) controls 7* . Where 
these vector fields come from constitutes the other part of our assumption. 

Let W be as in Definition [531 Let Xa be the Taylor coefficients of W when the Taylor series is taken 
in the t variable: 

Wit)^J2^"X^, (6.f) 

a 

SO that Xa is a C°° vector field on 51". 

Our assumption on 7 is that if we take the set of vector fields with formal degrees: 

V — {{Xa, deg (a)) : deg (a) is nonzero in only one component} , (6-2) 

then there is a finite subset J^ C V such that T generates a finite list {X, d) — {Xi, di) , . . . , {Xq, dq), 
and this finite list controls 7. 

Remark 6.4. The list of vector fields {X, d) depends on a few choices we have made in the above: it 
depends on the chosen subset J- and it depends on the chosen list generated by J-. However, none of 
these choices affects {X, d) in an essential way. This is discussed in Section If f .21 

Remark 6.5. Note that the above assumptions are local. Indeed, if C is a large compact set, and if 
for each point xq £ C, 7 satisfies the above assumptions with Kq some compact neighborhood of xq, 
then 7 satisfies the above assumptions with Kq replaced by C. Thus, one should think of Kq as a small 
neighborhood of a point. However, we find it more straightforward to fix Kq in what follows, as opposed 
to letting it (possibly) shrink from line to line. We took the opposite perspective in Sections II. 11 and [H 

Remark 6.6. The "curvature conditions" in JCNSW99] are equivalent to saying {Xa : a S N^} satisfies 
Hormander's condition (see (Cz) in Section [S]). We will see in Section ri7.3l that this is a special case of 
the above assumptions, when v ^ 1. 

Remark 6.7. Note that our conditions on 7 only involve the vector field W. It is not hard to see, though, 
that W uniquely determines 7. This is discussed further in Section [T^ 

Remark 6.8. Notice the scale invariance of Definition 16.31 Indeed, for 61,62 G A, the condition imposed 
on W{6it,x) is formally the same as the condition imposed on W{62t,x) provided one replaces 6iX 
with 62X. This scale invariance will play an essential role in our proofs. This also explains the entrance 
of Carnot-Caratheodory geometry in our assumptions: the Carnot-Caratheodory balls scale in the same 
way that our other assumptions scale. 

Remark 6.9. Unlike in Section [2l we have not separated our assumptions on 7 into a "finite type" 
condition and an "algebraic" condition. This distinction only has real significance when 7 is real analytic, 
and we, therefore, differ discussion of it to [SSI fbj . 

7 Statement of results 

Fix fl C M" open, and Kq (^ fl' (^ il" (^ il with Kq compact (with nonempty interior) and ft' and il" 
open but relatively compact in fl. Let 

jit,x) ■.B^{p) xfi" -^n 

be a C°° function such that 7 (0, x) = x. Here, p > is a small number. 

Fix 1/ G N positive, and A C [0,1]'' such that for 61,62 € A, 61 W 62 G A. Furthermore, let 
e = (ei, . . . , ejv) be given, with 7^ ej G [0, 00) . We suppose 7 satisfies the assumptions of Section [5] 
with this Kq, A, and e. 
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Theorem 7.1. For every ^j G C^ (R") supported in the interior of Kq, there exists a > such that for 
every K (z IC (N, e, a, A) the operator 

Tf{x)^i;{x)Jf{^t{x))K{t)dt (7.1) 

extends to a bounded operator L^ (R") — !• i^ (R"). 

Actually, Theorem 17.11 follows directly from the following, slightly more general theorem. 

Theorem 7.2. There exists a > such that for every ipi,^p2 S C'^ (R") supported on the interior of 
Kq, every K ^ JC {N, e, a. A) and every C°° function 

K {t, x) : B^ (a) X r^" ^ C 

the operator 

T (/) {x) = ^1 {x) J f (7* (x)) ^2 (7t (x)) n (i, a:) i^ (t) di (7.2) 

extends to a bounded operator i^ (R") -^ i^ (R"). 

Proof of Theorem \7.1\ given Theorem \7.2\ Given ip take -02 to be equal to 1 on a neighborhood of the 
support of tp. It is easy to see that if K has sufficiently small support, then -02 {jt (x)) = 1 on the 
domain of integration in (17. 2p (with ip in place of -01 )■ Recall that K is supported in B^ (a), and so 
this may be achieved by shrinking a. Taking k = 1 yields Theorem 17. II D 

Remark 7.3. The reason we work with Theorem l7.2l is that the class of operators given by (|7.2[) is closed 
under adjoints, while the class of operators given by (|7.ip is not. This may not be immediately obvious 
from our assumptions. See Section [12.31 for details. 

Remark 7.4. In the second paper in this series (joint with Elias Stein [SSllaj ). we will prove a corre- 
sponding LP theorem (1 < p < oo). The operators for the L^ theorem will not be as general as those 
covered by Theorem 17.21 we will need to assume a special form for the set A. However, A = [0, 1]" is 
of this form, and so in that special case, the results of Theorem 1 7. 2 1 do extend to L^. 

8 Basic Notation 

Throughout the paper, for v — (ui,...,w„) 6 R", we write |ti| for (^ \vj\ j , and we write \v\^ for 

sup„ \vj\. B" (77) will denote the ball of radius 77 > in the |-| norm. For two numbers a, 6 G R we write 
aV5 for the maximum of a and h and aAb for the minimum. If instead, a — (ai , . . . ,an) ,b — (61 , . . . , 6„) G 
R", we write a V 6 (respectively, aAb) for (ai V 61, . . . , a„ V 6„) (respectively, (ai A 61, . . . , a„ A 6„)). 

For a vectors S = {Si, ... , S^) , d = {di, . . . , di/) G R'', we define S'^ by the standard multi-index 
notation. I.e., 6'^ = JJ^^i SJ'^ ■ Also we wiU write 2^* = {2'^\ . . . , 2'^-). 

Given a, possibly arbitrary, set U C R" and a continuous function / defined on a neighborhood of 
U , we write 

WfWcHu)^ E snp\dSf{x)\, 

and if we state that ||/||f7j([/) is finite, we mean that the partial derivatives up to order j of / exist on 
U, are continuous, and the above norm is finite. If / is replaced by a vector field Y = ^^ ak (x) dx^, 
then we write, 

Given two integers 1 < 771 < n, we let I [m, n) denote the set of all lists of integers (ii, . . . , im) such 
that 

1 < ii < i2 < ■ ■ ■ < im < n. 
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Furthermore, suppose A is an n x g matrix, and suppose 1 < no < n Aq. For / G I (no, n), J e I (no, q) 
we let Aj^j denote the no x no matrix given by taking the rows from A which are hsted in / and the 
columns from A which are listen in J. We define 

det A = (det A/,j)/£i(„ „) , 
""^"^ JeX(no,g) 

so that, in particular, det„pxno ^ is a vector (it will not be important to us in which order the coor- 
dinates are arranged). det„(,xno A comes up when one changes variables. Indeed, suppose $ is a C^ 
diffeomorphism from an open subset U C K"" mapping to an no dimensional submanifold of R", where 
this submanifold is given the induced Lebesgue measure dx. Then, we have 



f{x)dx^ / /($(t)) 



det d$(t) 

rioxno 



dt. 



If yl = (Ai, . . . , Aq) is a list of, possibly non-commuting, operators, we will use ordered multi-index 
notation to define A", where a is a list of numbers 1, . . . ,q. \a\ will denote the length of the list. For 
instance, if a = (1, 4, 4, 2, 1), then |a| = 5 and A" = A1A4A4A2A1. Thus, if Ai, . . . Ag are vector fields, 
then A" is an |a| order partial differential operator. 

If / : R" —i' R™ is a map, then we write, 

for the differential of / at the point x applied to the vector field -^. If / is a function of two variables, 
/ {t, x) : M.^ X R" -^ R™, and we wish to view df as a linear transformation acting on the vector space 
spanned by ^ (1 < J < N), then we instead write, 

to denote this linear transformation. Hence, it makes sense to write, 

det —-(t,x), 

riQXno at 

where uq < m A N. 

Finally, we will devote a good deal of notation to multi-parameter Carnot-Caratheodory geometry. 
See Sections [51 and [TT] 

9 The work of Christ, Nagel, Stein, and Wainger 

In this section we review some of the aspects of the fundamental work |CNSW99] , and we also discuss 
some further results that follow easily from that work, which will be useful in the multi-parameter 
setting. The results of this section will be of use to us in two ways. First, the work in ^CNSW99 offered 
the primary inspiration for the results in this paper, and a review of these results will offer the reader 
much better context in which to understand this paper. Second, we will be able to save quite a bit of 
effort by "transferring" some of the single parameter results from [CNSW99] to the multi-parameter 
setting. For an example of this "transferring" in a simpler context, see Section 5.2.4 of jStrllj . 
In |CNSW99j . operators of the form, 

Tf{x)^i;(x) j !{-ft{x))K{t)dt (9.1) 

were studied, where '0 is a cutoff function supported near a fixed point xq G R", K is a Calderon- 
Zygmund kernel supported near t = 0, and 7 : R^ x R" — >■ R" is a C°° function^ H with 70 (x) = x and 
satisfying an additional condition, denoted by (C) in |CNSW99] . 

^-•^ Actually, wc only need 7 defined near G M^ and xq £ R". 
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Moreover, |CNSW99] showed that (C) could be viewed as any one of a number of equivalent condi- 
tions. Three of their conditions are useful for our purposes. We review these three below, along with a 
fourth condition (not directly studied in jCNS W99) ) which is also equivalent to the other three. 



/99j showed that 7^ can be written asymptoticly as 



22 



7t (x) ~ exp I ^ f^Xa ) X, 

i|>0 



(9.2) 



where each X„ is a C°° vector field on R" (see Theorem 8.5 of |CNSW990 . Condition (Cg) then 
states that the vector fields Xa satisfy Hormander's condition. I.e., that the Lie algebra generated 
by the Xa spans the tangent space to R" at xq. 



(Cj): One defines T 



^R" by 

r (r, x) = 7(1 o 7(2 o • • ■ o 7t„ (x) , 



(9.3) 



where t — (t^, . . . , t") G M^". (Cj) then states that for some multi-index /?, we have. 



d\\ dT , 

a~ det -- (t,Xo) 



T = 



^0. 



(Cy)|f!| For 1 < j < iV, define the C°° vector fields. 



W,{t,x) = d^{t,^^\x)) 



_d_ 
9t" 



_d_ 

dsi 



It+s o 7i ^ {x) 



s=0 



We now express Wj as a Taylor series in the t variable. 



W,(i)^^f 



aXaJ 



where Xaj is a C°° vector field. (Cy) then states that the vector fields {Xaj ■ \a\ > 0,1 < j < N} 
satisfy Hormander's condition at xq- 

{Cz): Define the C°° vector fieldEI 



W[t,x) 



d_ 
We 



let O It ^ i^) ■ 



e=l 



Express M^ as a Taylor series in the t variable. 



where Xa is a C°° vector field. (Cz) then states that the vector fields {Xa} satisfy Hormander's 
condition at xq. 



The following result is mostly contained in |CNSW99] : 

Theorem 9.1. (Cg) <^ {Cj) <^ (Cy) <^ (Cz)- We denote by (C) any of the above equivalent conditions. 



22||g2J means that, for every M, ■yt (x) = exp (Z;o<|Ql<Af *"'''^") + O (|t|^M , as t -> 0. 

23See the bottom of page 521 of ICNSW99I to find (Cy). 

^^Once again, et = (eti, . . . , etn) and so does not use any non-standard dilations as in (14.111 . 
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Proof. (Cg ) ^ [Cj) is contained in Theorem 8.8 of |CNSW99| . (Cg) ^ {Cy) is Proposition 9.6 of 
[CNS W99) . We are therefore left with showing only {Cy) <^ (Cz) and the proof will be complete. 

For a set of C°° vector fields V, let Lie (V) denote the Lie algebra generated by V. Let Wj, Xa,j, 
W, and Xa be as in the definitions of (Cy) and (Cz)- We will show, 

Lie {Xaj ■■l<j<N,ae N^} = Lie {X^ : a e N^} , 

which will complete the proof. In fact, we will show for every M e N, 

Lie {X^j : 1 < j < A^, |a| < M} = Lie {X^ : \a\ < M + 1} . (9.4) 

We proceed by induction on AI. It follows directly from the definition of W and Wj that, 

N 

W{t,x) = Y.*,W,{t,x). (9.5) 

In light of (|9.5p . the base case {M = 0) of (|9.4p is trivial. We assume we have (19.41) for M — 1 and prove 
it for M. From (|9.5p . the containment D is trivial, and so we prove only the containment C. To do so, 
we use the fact that 7 satisfies the ODE in each variable, 

J-7t(x)=I^,(t,7t(a:)). (9.6) 



Hence, the Wj must satisfy the integrability condition: 

^[W,,Wk]. (9.7) 



dWj dWk 



7 + — Q 



dtk dtj 

See, for example^ Theorem 10.9.4 of (Die60] . Let fj be the multi-index which is 1 in the jth coordinate 
and in all other coordinates. From, (|9.7p . we obtain, 

^ 7! ' (5! 

As a consequence, if jaj = M — 1, our inductive hypothesis shows, 

X„+/, J - X„+/,.,fc = mod Lie {X, : |a| < M} . (9.8) 

Fix /3 with |/3| = M. We wish to show X^j- G Lie {X^ : |a| < M + 1}. For every k such that /3-/fc e W , 
we have by ^M (since |/3 - /fc| = M - 1), 

^fi+!,-S.M = Xn^, mod Lie {X^ : \a\ < M} . 

Applying (|9.5p . we see that the coefficient of t^^^^ in the Taylor series for W is given by, 

^/3+/.= E TWZT^^^^'^^'^ modLic{X„:|a|<Af}, 

where C is a nonzero constant. Hence, Xj^j S LiejATQ : |q;| < M + 1}, completing the proof. D 

One of the main theorems of f CNSW99] is the following: 

Theorem 9.2 (Theorem 11.1 of [iCNSW99] ). Suppose 7 satisfies (C) at some point xq, and suppose ip 
is a C^ function supported on a sufficiently small neighborhood of xq and K is a standard Calderon- 
Zygmund kernel supported sufficiently close to 0. Then, the operator given by (j9.ll) extends to a bounded 
operator on L^ (1 < p < 00/ 



^^That 119.71 1 must hold follows simply by using the equality dt^dt-ft = dtdti^ft and applying 119. 6| l. 
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Remark 9.3. We will use multi-parameter analogs of the concepts in the above four conditions. For 
example, the assumptions placed on 7 in Section [6] is the relevant multi-parameter analog of (Cz)- We 
will discuss multi-parameter analogs of the other conditions described above, and a multi-parameter 
analog of Theorem 19.11 in Section 112.11 

As is discussed in Section 5.2.4 of [Strll] . results in the single-parameter setting (like Theorem 19. ip 
can often be lifted to the multi-parameter setting, provided the results are uniform, in an appropriate 
sense. Fortunately, in this instance, the desired uniformity follows from Theorem 19. II via a compactness 
argument. We now state and prove the main consequence of Theorem 19.11 which we will use. 

For notational convenience, we take xq = in what follows. We wish to define "uniform" versions of 
(Cg), {Cj), (Cy), and (Cz)- Fixp > 0, r? > 0, and let 5 be a set of C°° functions 7 : B^ (p)xS" (ry) -^ W\ 
satisfying 70 (x) = x. We wish to define the notion of the above conditions holding uniformly for 7 G 5. 
We denote this by (C^), (C}'), {Cy), and (C|) respectively. 



(Cg): For 7 G 5, define Xa as in (Cg). (C") states that there exists M £ N, independent of 
7 G iS, such that < Xa '■ |a| < M > satisfies Hormander's condition at 0, uniformly for 7 G 5. More 
precisely, that there exists M' G N, c > 0, independent of 7 G 5 such that if we let Vi, . . . , Vl 
denote the list of vector fields containing IXa '■ |a| < M>, along with all commutators of the 

vector fields in < X^ '■ |a| < M > up to order A/', then we have, 



det V (0) 



>c, 



where we have written V to denote the matrix whose columns are Vi , 



,Vl. 



{Cj): For 7 G iS, define F as in {Cj). {Cj) then states that there is an Af G N and a c > 0, both 
independent of 7 G 5 such that for every 7 G 5 there exists (3 with \/3\ < M, and 



— det— T,0 

Ot / nxn Or 



T = 



> c. 



(Cy)'- For 7 G iS, define Xa.j as in (Cy)- (Cy) states that there exists M G N, independent of 
7 G 5 such that {Xaj : \a\ < M, 1 < j < N} satisfies Hormander's condition at 0, uniformly for 
7e5(asin(C«)). 

(C^): For 7 G 5, define the vector fields {Xa} as in (Cz)- {Cz) states that there exist Af G N, 
independent of 7 G 5, such {Xa '■ \a\ < M} satisfies Hormander's condition at 0, uniformly for 

7G5(asin(C«)). 



Theorem 9.4. Let S be as above, and suppose S C C° 



S^ (p) X B" (tj) ; M") is a bounded set. Then, 



Proof. We prove only (C^) => {Cj) as that is the most important implication for our purposes. All of 
the other implications follow in a completely analogous manner. Suppose S satisfies (C^) but not {C'j). 
Let 7'' G 5 be a sequence satisfying the following. Define F-' in terms of 7^ as in (|9.3p . We choose 7^ so 
that for every multi-index /3 with |/?| < j, we have, 



^ det^(r,0) 
Or J nxn Ot 



1 
< -. 

j 



This is clearly possible since S does not satisfy {C^). Since bounded subsets of C°° are precompact, 
we may select a convergent subsequence 7^ -^ 7°°. It is easy to verify (since S satisfies (Cf )) that 7°° 
satisfies {Cz). It is also easy to see, by our choice of 7-', that 7°° does not satisfy {Cj). This contradicts 
Theorem 19.11 and completes the proof. D 
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Remark 9.5. The reader might suspect that one could improve Theorem 19.41 bv tracing through the 
proof in [CNS W99) . and determining in precisely which way AI and c (from {Cj)) depend on the C^ 
norms of the elements of S and Mi, M2, and c (from (C^)). In fact, this is the case (at least if one 
replaces (Cj) with (Cj) as defined in Definition 10.9 of |CNSW99] ) . One can use this to improve the 
way in which various constants depend on each other in this paper. However, this improvement would 
not change the main thrust of our results. Moreover, even if one does proceed in this manner, all of our 
results would still be far from optimal in this sense. We, therefore, leave such details to the interested 
reader. 



10 Informal outline of the proof 

Before we begin the rigorous proof of Theorem 17. 2[ in this section we offer an informal outline to help 
the reader see the big picture of what follows. We also highlight the difficulties that arise in the present 
work, that did not arise in previous works, such as |CNSW99] . We take all the same notation as in 
Theorem l7.2l Furthermore, for the purposes of this section, we imagine that the vector fields Xi, . . . , Xq 
span the tangent space at every pointo This will not be used in later sections, but it makes the informal 
description of the argument simpler. Letting K G JC {N,e,a,A) we may decompose K, 



K = 



E 



<;, 



(2^) 



je-iog^A 



Defining, 



T,f (x) = i,, (x) / / (7t (x)) 4,2 {it {x)) K (i, x) d ' (t) dt 



(2O 



T, 



(10.1) 
(10.2) 



we wish to study the operator, 

E ^.- 

je-iog^-A 

The goal is to show that (|10.2I) converges in the strong operator topology, as bounded operators on 
L^. In fact, we will show that the family {Tj} . ^ is an almost orthogonal family, and the result will 
then follow from the Cotlar-Stein lemma. More specifically, we will show, 



ll^fc^illL2_j.^2 



\w:\\l^ 



(10.3) 



for some e > 0, for every j,k G — log2 ^. Theorem 17.21 follows directly from (110. 3p . In this section, we 
focus just on 



ll^fe^illL2^i2 



< 2^'=I.J~'=I 



(10.4) 



the other inequality being similar. In what follows, we let e > be a number that may change from line 
to line, but will always be independent of j, fc G — logj A. 
(|10.4p follows directly from the inequality. 



{T;TknT,) 



n+l 



L^^L^ 



< 2^'l^'^'=l 



where n is the dimension of the a;-space. One has the trivial inequality, 

\n+l 



iTmnr, 



k-'-j) 



L^^L^ 



<1, 



and so interpolation with (jl0.6|) shows that to prove (jl0.5|) we need only prove. 



[T;TkT^Tj) 



n+l 



< 2^^!^^*^ 



We will see, for a fixed point xq, 



{T*nnT,y^'f{xo) 



(10.5) 



(10.6) 



(10.7) 



(10.8 



^These are the vector fields from Section (6] obtained from tfie map 7. 
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depends only on the values of / on a small neighborhood. U = U (j, fc, xo), of xq (later in the paper, we 
will have to be precise about how small a neighborhood) cj Because of this, given j, fc, and xo, we will 
be able to construct a diffeomorphism, 

$ = $,, fc,,„ : B" (ry) ^ f/, 

with $(0) — Xq and rj is independent of j, fc, and xq. The point of this diffeomorphism will be to 
"rescale" (|10.7I) near the point xq so that it will follow from elementary considerations. 
Let $#/ (u) = / (<& (u)). Then, to prove (|10.7p . it suffices to show, 

$# (T;*T,.T,!T,)"+' ($#)-' g(0) I < 2-IJ"-'=l ||.g||^^ , (10.9) 

where g — ^"^ J . Here, the implicit constants in (|10.9I) cannot depend on the point xq. 

Remark 10.1. Later in the paper, we will not explicitly conjugate our operators by the puUback ^"^ . 
However, the way in which we proceed is equivalent to the above. 

The map $ will be defined in such a way that the effect of conjugating Tj by ^"^ leaves one with 
an operator of the same form as Tj (though, with a different 7) but with j replaced by j', where 
i' — j — j A k. With an abuse of notation, we call this new operator Tj>. Similarly, Tk is replaced by 
Tfc/ where k' = k - j A k^ 

In this way, we can reduce (|10.7p to the case when j A fc = (provided one has good enough control 
of the implicit constant in (llO.Tp ). More precisely, if we letr^l 

6»t(u) = $"io72-,Afcto$(u), 

then we are interested in the operator, 

Tfg (u) = g {02-3't (")) « (^. u) ^j (t) dt, 



where k is a C°° function (uniformly in any relevant parameters) and we have suppressed ^i and ■02 
since we are working locally. A similar formula holds for Tk> . 

The diffeomorphism $ will be defined in such a way that if we formally write. 



% (u) - e^o<i"i*°'*'°u, 



(see Section [9]) then the vector fields 

{Yq, : deg (a) is nonzero in precisely one component} (10.10) 

satisfy Hormander's condition uniformly in any relevant parameters. Rewriting pO.lOp using the fact 
that / A A:' = 0, we see, 

{Ya : f ■ deg (a) = or fc' • deg (a) = 0} (10.11) 

satisfy Hormander's condition uniformly in any relevant parameters. Hence, writing, 

' *^ ^ (10.12) 



5-fc 



t (m) - e^°<i°i * ^ ^"^ 



we see the set of vector fields in (|10.12p which are not scaled satisfy Hormander's condition. Because of 
this, it will be possible to apply the methods of CNSW99 to the operator, 

T*Tk'T^,T,,, 

to prove (|10.9p and complete the proof. 



^'If the vector fields Xi, . . . , Xq do not span the tangent space at xq, U would instead be a small neighborhood of xq 
on the leaf passing through xq , generated by Xi , . . . , Xq . 

^*Moreover, even in the case when Xi, . . . ,Xq do not span the tangent space, the analogous vector fields associated to 
Tji and Tj./ do span the tangent space (uniformly in any relevant parameters). 

^®<I> will be defined in such a way that the map 9 defined above is C°°, uniformly in any relevant parameters. 
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Remark 10.2. In short, the above says that the map 02-''' t °^2-i' t satisfies (Cg) uniformly in any relevant 
parameters. When we turn to the rigorous proof, it will be more convenient to use the (equivalent) fact 
that as j, k vary, the set of all such 02-k'f o 6'2-j't2 satisfies (C^)- We will then be able to apply Theorem 



Remark 10.3. Note that we have used, in an essential way, that we are only considering those vector 
fields Ya such that j' ■ deg (a) = oi k' ■ deg {a) = 0. This highlights the difference between the pure 
powers and non-pure powers discussed in Section [3] 

We close this section by discussing some difficulties that arise in the multi-parameter situation, that 
did not arise in the single parameter situation of [CNS W99] . 

In ^CNSW99j the Rothschild-Stein lifting procedure (see |RS76| ) was used to "lift" the problem to a 
higher dimensional setting; in fact the problem was lifted to a high dimensional stratified Lie group. In 
this higher dimensional setting, the scaling map $ had a very simple form: it was given by the standard 
dilations on this stratified Lie group. In the multi-parameter setting, this lifting argument creates more 
difficulties than it alleviates. In particular, it is unclear how to consider the non-pure powers, nor how to 
naturally deal with the multi-parameter dilations. Because of this, we must work directly with a more 
complicated scaling map $. This map was defined and studied in [Strllj . and we review the relevant 
theory in Section [TlJ As a consequence, the lifting procedure is not necessary to prove the results of 
[CNSW99] . The single-parameter version of the results in [Strllj were first obtained in (NSW85| . Using 
the scaling maps from [NSW85J , one can (with some reordering of the proof) recreate the entire theory 
of |CNSW99] without resorting to the lifting procedure. 

In the single-parameter case, one is given two numbers j, fc e N, and one wishes to show, for instance, 

\\T*T-\\ < 9^«IJ^*=I 



When, for instance, k < j, it suffices to show. 






for some large, but fixed, M. In the multi-parameter situation, however, it could be that some coordi- 
nates of j are greater than those of k, while the opposite is true for other coordinates. Because of this, 
we must instead show. 

This turns out to be more of a notational difficulty than anything else. 
10.1 Diffeomorphism invariance 



The assumptions of Theorem l7.2l are invariant under diffeomorphisms. More precisely, if we let T be the 
operator in Theorem 1 7. 21 and if we let ^ : il -> fJ be a diffeomorphism, then the operator (^*) T\E'# 
satisfies the assumptions of Theorem 17.21 This is straightforward to check. 

It turns out that more is true. Denote by f the map {^*) ~ T'i'*, and by fj the map (*#) ~ Tj*#, 
where Tj is as in Section [TOl Note that T = J2je-iog a'^j^ ^^^ ^^ ^^'^^ ^^ '^^ ^^'^ applied the proof in 
Section [TO] to T, this is the decomposition of T we would have used. Fix j, k, and Xq as in Section [TOl 
Let <I> be the scaling map associated to Tj,Tk at xq, and let $ denote the scaling map associated to 
Tj, Tk at '^ {xq). It is not hard to see, from the definition of $ in Section [TT] that. 

$ = V]/ o $. 

Thus, we have, 

$#f, ($#)"' = $#*# {^*)-^T,^* {m*)-^ ($#)-' = ^*T, ($#)"' ; 

with a similar result for Tk and T^. Thus, one obtains the same operators when pulling back Tj and 
Tk via $ as when pulling back Tj and Tk via $. Hence, not only are the assumptions of Theorem 17.21 
invariant under diffeomorphisms, but a large part of the proof actually remains completely unchanged 
when conjugated by a diffeomorphism. 
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11 Multi-parameter Carnot-Caratheodory geometry revisited 

In this section, we present the resuhs that ahow us to deal with Carnot-Caratheodory geometry. The 
resuhs we outhne here are contained in Section 4 of |Strll) . The heart of this theory is the abihty to 
"rescale" vector fields. This rescaling is obtained by pulling back via a particular diffeoniorphism, which 
will be denoted by $ in what follows. 

Before we can enter into details, we must explain the connection between multi-parameter balls and 
single-parameter balls. We assume we are given C°° vector fields Xi^ . . . ,Xq with associated formal 
degrees 7^ di, . . . , dg G [0, 00) . Here, i^ G N is the number of parameters. Given the multi-parameter 
degrees, we obtain corresponding single parameter degrees, which we denote by ^ d and are defined by 
(X) '^)i '■— X]u=i "^^ — Mjli- Let 5 S [0, 00)'', and suppose we wish to study the ball 

B(x.d) {^0,5). 

Decompose 5 = 5o5i where 5q £ [0, 00) and Si e [0, 00)" (of course this decomposition is not unique). 
Then, directly from the definition, we obtain: 

B(x,d) (2^07 (5) = B(^s^x,j2d) (2:0, (5o) = Bi^sx.Y.d) (a^o, 1) • 

Thus, studying a ball of radius 5 corresponding to {X, d) is the same as studying a ball of radius 1 
corresponding to ((5X, ^d). For this reason, taking K^ and A as in Section [S] and assuming {X,d) 
satisfies V {Kq^A), we will fix xq G Kq and 5 £ A and study balls of radius « 1 centered at xq 
corresponding to the vector fields with single-parameter formal degrees [SX^^d). In what follows, it 
will be important that all of the implicit constants are independent of xq G Kq and 5 £ A. 

We now turn to stating a theorem about a list of C°° vector fields Zi^ . . . ,Zq defined on an open set 
17 C R", with associated single parameter formal degrees di, . . . ,dq G (0, 00). The special case we are 
interested in is the case when [Z,d] = {SX,^d); i.e., when Zj — S'^^Xj and dj = \dj\^. 

Fix xq £ CI and 1 > ^ > 0|f3 Let uq — dimspanjZi (xq) ,. . .,Zq (xq)}. For J — (ji, . . . ,:;„o) G 
X (no, q), let Zj denote the list of vector fields Zj^ , . . . , Zj^ . Fix Jo G I {no, q) such that 



det Zjo {xo) 

noxno 



det Z {xq) 

noXno 



where we have identified Z (xq) with the n x q matrix whose columns are given by Zi (xq) , . . . , Zq (xq) 
and similarly for Zj^, (xq). We assume iZ,d] satisfies C {xo,£,,fi). In addition we suppose that there 
are functions c*"' ■ on B/ „ r\ {xq, ^) such that 



[^^' ^j] == Yl ^tj^k, on B(^^_j^ {xo,0 



We assume that: 



• II^jIIc™(B(,,,)(.o.?)) <°o for every m. 

• E|a|<m ll^^^jllcors (^j,,^)) < oo- ^OT cvcry m and every i,j,k. 

We say that C is an m-admissible constant if C can be chosen to depend only on upper bounds for the 
above two quantities (for that particular choice of ?7i), m, upper and lower bounds for di, . . . , dg, an upper 

bound for n and q, and a lower bound for ^. Note that, in our primary example [Z,d] = {SX, ^ d), 

m-admissible constants can be chosen independent of xq G Kq and S E A. We write A <„i B ii A < CB, 
where C is an m-admissible constant, and we write A w,„ B ii A <m B and B <„ A. Finally, we say 
T = T (k) is an ?n-admissible constant if r can be chosen to depend on all the parameters an m admissible 
constant may depend on, and r may also depend on k. The next result is contained in Section 4 of 
[gfaTT] . 



^"In our primary example, one takes xq G Kq and ^ as in X" {Kq, A). 
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Theorem 11.1. There exist 2-admissible constants ?yi,^i > such that if the map $ : i?"° (?7i) — > 
^(zd) (^0,0 *'5 defined by 

we have 

• $ : B"" (rji) -^ ^(z d] (^OjO 's injective. 

• %,~)(a;o,a)C$(i3""(77i)). 

Furthermore, if we let Yj be the pullback of Zj under the map $, then we have, for m > 0, 

Il^jllc'"(i3"0(^i)) <mV2 1, (11-1) 

ll/llc"(B"0(,,i)) ~(m-l)V2 2^ II^"/llc0(B"0(,,i)) • (11-2) 

\a\<m 

Finally, 

det r (m) «2 1, Vw e B"° (771) . (11.3) 

Note that, in light of (|ll.ip and (|11.3p . pulhng back by the map $ ahows us to rescale the vector fields 
Z in such a way that the rescaled vector fields, Y , are smooth and span the tangent space (uniformly 
in any relevant parameters). We will also need the following technical result. 

Proposition 11.2. Suppose ^2,'72 > are given. Then there exist 2-admissible constants rj' — r/ (^2) > 
0, ^' = ^' (772) > such that, 

$(i?"«(^'))^%J) (2^0,6), 

i?(^,j)(xo,r)c$(i?"n^2)). 

Proof The existence of 77' can be seen by applying Theorem 111.11 with ^ replaced by '^A^2- The existence 
of ^' can be shown by combining the proof of Proposition 3.21 of |Strll) with the proof of Proposition 
4.16 of [Strll]. D 

Remark 11.3. With a slight abuse of notation, when we say m-admissible constant, where m < 2, we 
will take that to mean a 2-admissible constant. Using this new notation, the V in (|ll.ll) and (|11.2I) may 
be removed. 

Remark 11.4. One way to think about Theorem 1 11. II is from the perspective of the classical Frobenius 
theorem. Indeed, the main assumption in Theorem 1 11. II is essentially that the distribution generated by 
hy Zi, ..., Zq is involutive. One can view the map $ as a coordinate chart on the leaf passing through 
xq generated by Zi, . . . , Zq. Theorem 111.11 gives quantitative control of this coordinate chart. This is 
discussed in more detail in |Strllj . 

Remark 11.5. It is not hard to see that the single-parameter formal degrees d do not play an essential 
role in the above (see Remark 3.3 of |Strllj ). In fact, one could state Theorem Ill.H taking all the 
formals degrees dj = 1 and that would be sufficient for our purposes. Moreover, in every place we use 
the single-parameter formal degrees d, they are inessential. We have stated the result as above, though, 
to allow us to transfer seamlessly between the vector fields {X, d) and ( Z, d 1 , without any hand- waving 
about the formal degrees. 
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11.1 Control of vector fields 

In this section, we introduce a notion (closely related to Definition I6.3P to discuss when the list of vector 
fields with formal degrees "controls" a vector field with a formal degree (Xq, do). Informally, this means 
that if we were to add {Xo,do) to the list (X, d), we would not "get anything new." In particular, 
under the conditions we will define, we will not significantly increase the size of the balls Bi^x,d) {xq:5) 
(we would, instead, obtain comparable balls), and Theorem 1 1 1 . 1 1 will hold with the same choice of the 
map $ (when Theorem lll.il is applied to (JX, ^d)). Many of the results discussed in this section are 
discussed in more detail, and proved, in Sections 4.1 and 5.3 of |Strll| . and we refer the reader there 
for more details. 

Following in the philosophy of Section [TTl we will state our definitions and results for single parameter 

vector fields (Z,d\ near a fixed point xq €Vl, where one should think of the case (Z,d\ = {6X,J2d), 

where S (z A and xq G Kq (and all of the results and definitions hold uniformly in S and xq). We 

maintain all the same assumptions on [Z,d\ (and associated notation) as in Section [TT] In particular. 



Theorem 1 1 1 . 1 1 applies and we obtain 2-admissible constants ?]i and ^i and a map $ as in Theorem 1 11. II 
Suppose we are given a C°° vector field Zq with an associated single parameter formal degree do. 
Let (Z',d'] denote the list of vector fields with formal degrees 

( Zo,doj , {Zi,dij , . . . , iZq,dqj : 

the list of vector fields iz,d] with (Zo,(io) added. We introduce the following two conditions on 

I Zo, do ) which turn out to be equivalent. All parameters that follow are assumed to be strictly positive 
real numbers: 



1. Vi (ri, {an,neN) (^ < 6): There exist c, e C" (^B^^j^ (xo,ti)) 



such that: 



• ^0 = E|=i CjZj, on S(-2 j^ {xo, Ti). 

2. 7^2 (?72, WT}m£N) (^2 < ??i): Zq is tangent to the leaf passing through xq generated by Zi, . . . , Zq, 
and moreover, if we let Yq be the puUback of Zq via the map $ from Theorem lll.il we have, 

ll^o|lc-(B"o(,,2)) < <^T- 

Proposition 11.6. Vi <=> 7^2 in the following sense. 

• "Pi (ti, {cr™}^gpj) => there exists a 2-admissible 772 = V2{ti) > and m-admissible constants 
fj™ = a^ (ct5") such that V2 (j?2, {(7"},„gn) holds. 

• 7^2 (??2, {f^lmppj) ^ there exists a 2-admissible constant ti — ti (r/2) > and m-admissible 
constants (t™ = a^ (ct™) such that Vi (n, {o'T}m,eN) holds. 

Moreover, in the case that Vi (ti, {"""'ImgN) holds, we have the following. 

• There is a 2-admissible constant t' — t' ido,Ti,(Tf] , such that 

B[z,d) (2^0, t') C S^2,_j,) {xo,t') C B(^_j^ (a;o,Ti) . 

• The same map $ satisfies all the conclusions of Theorem \ll.l\ with iZ,d] replaced by [Z',d' 
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Proof. Vi ^ 7^2 follows just as in Proposition 4.19 of |Strllj (here Vi is related to Vs in jStrllj ). 
We now turn to V2 ^ Vi- Take n — ri (772) to be a 2-admissible constant so small that, 



B 



^zJ)i=^o,r^)^^{B"''{m)), 



by applying Proposition lll.2l Let Yi, . . . ,Yq be the puUbacks of Zi, 

and (jll.3|) . we may write, 

q 

Yo = Y.c,Yj, oni3""(^2), 



with. 



|a|<m 



j=i 



r-ll < 



^jllC'"(B"o(,,2)) 



, Zq via the map <&. Using (|ll.ip 

(11.4) 

< 4". (11.5) 



Pushing (|11.4I) forward via the map $ and combining it with the push- forward of (|11.5I) yields Vi- 

We now turn to the second part of the proposition: when Vi (or, equivalently, P2) holds, the above 
two conclusions hold. The first follows just as in Proposition 4.18 of jStrll] . For the second, note that 
in Theorem lll.il we chose Jq such that 



det Zj„ (a;o) 

noXno 



det Z (xq) 
no X no 



however, the theory from [Strllj shows that we only need 



> 



det Z (xq) 

noXno 



det Zj^ {xo) 

noXno 



Thus to show that the same map $ can be used for \Z',d'\, we need only show, 

no = dim span {Zq (xq) ,-.-,Zq (xq)}, 



det Z [xq) 

noXno 



> 



det Z' [xq) 

"oxno 



(11.6) 



D 



pr^ follows from Theorem 4.17 of [Strllj (the V^ ^ V^ part of Theorem 4.17). 

Definition 11.7. We say {Z,d\ controls ( 2'o,(io) at the unit scale near xq if either of the above two 
equivalent conditions {Vi or V2) holds. 

Definition 11.8. Let (z,d\ = fZi, diV . . . , fz,, d,^ and [z',d'\ = {z[,d'^ , . . . ,(z'^,,d'^}j be two 
lists of vector fields with single parameter formal degrees as above. We say [Z,d] controls (Z',d'] at 
the unit scale near xq if iZ,d] controls ( Z', d'- j at the unit scale near xq for every 1 < j < q' ■ We say 
[Z,d] and lZ',d'] are equivalent at the unit scale near xq if [Z,d] controls lZ',d'] at the unit scale 
near xq and iZ',d'] controls iZ,d] at the unit scale near xq. 



Proposition 11.9. // fZ, d) and {Z',d'\ are equivalent at the unit scale near xq, and if ^ is the 

map obtained when applying Theorem \11.1\ to the list iZ,d], then the same map $ satisfies all the 
conclusions of Theorem \ll.l\ with I Z', d' ) in place of iZ,d] (with possibly different constants). 

Proof. The proof is an easy addition to Proposition lll.6[ and we leave the details to the reader. D 
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Definition 11.10. Let Kq (e ^ and A C [0, 1]'^ be as in Section[5l and let {X,d) be a list of C°° vector 
fields with j/-parameter formal degrees as in Section [5l satisfying V{Ko,A). Let Xg be another C°° 
vector field with an associated formal degree 7^ do G [0,oo)'^. We say {X,d) controls {Xo,do) if for 
every S £ A, xq £ Kq, {SX, ^ d) controls (^6'^°Xq, |rfoli) at the unit sale near xq in such a way that the 
parameters of Vi (equivalently 'P2) can be chosen to be independent oi xq £ K., S £ A. 

Remark 11.11. Note that 'D{Kq,A) is equivalent to saying that {X,d) controls {[Xi,Xj] ,di + dj), for 
every 1 < i, j < q. 

Definition 1 1 1 . 1 01 is closely related to Definition l6.3l Indeed, we have the following proposition, whose 
proof we defer to Section 112.21 

Proposition 11.12. Let {X,d) be as in Definition \11.10\ Define, 

where each Va is a C°° vector field. Then, 7 is controlled by {X,d) if and only if (VQ,deg(a)) is 
controlled by {X,d) for every a. 

Definition 11.13. Let {X,d) = (Xi, di) , . . . , (Xi,d,) and {X',d') = {X[, d[) , . . . , {X'^, , d'^,) be two 
lists of vector fields with i/-parameter formal degrees satisfying V {Kq, A). We say {X, d) controls {X' , d') 
if {X,d) controls (X',(i') for every 1 < j < q' . We say that {X,d) and {X',d') are equivalent if {X,d) 
controls {X',d') and {X',d') controls {X,d). 

Remark 11.14. In SectionlHl we singled out a list of vector fields with formal degrees {X, d). Throughout, 
we will use these vector fields to study 7. In everything that follows, and because of the results in this 
section, it would work just as well to replace {X,d) with an equivalent list {X',d') in our applications 
of Theorem 111.11 

11.2 More on {X,d) 

Our assumptions in Section [6] above might make it seem that we have made some choices which affected 
the list {X, d). Specifically, we supposed that there existed a finite set .F C V such that T generated a 
finite list {X, d) and this list controlled 7 (here we are taking the notation from Section [6l in particular, 
V is given by (16.21) '). One might expect that {X,d) depends on this choice of J-. However, this is not 
the case: given such a T and given T' CV finite with T C J^', then any finite list {X' , d') generated by 
J-"' is equivalent to {X, d) |f^ This follows from the following lemma, and the theory in Section 111.11 

Lemma 11.15. Suppose {X,d) controls 7. Then, if Xa is as in (|6.ip . {X,d) controls (XQ,deg(a)). 

Proof. This follows easily from the definitions. D 

As is discussed in Section flLli in light of Lemma [11.151 one does not "get anything new" by adding 
{Xa,deg{a)) to the list iX,d). 

Because of the above, there is a more direct way to construct the list {X,d). Fix M large. One 
considers the set 

J^ = < (Xq, deg (a)) : deg (a) is non-zero in only one component, \a\ < M >. (11-7) 

One then recursively assigns formal degrees to commutators of vector fields of J' as follows: if Xi has 
formal degree di and X2 has formal degree 1^2 , then [Xi , X2] is assigned formal degree di+d2. One then 
takes iterated commutators of the elements of J-', and we assume that, after taking all iterated commu- 
tators up to order M, the resulting finite list of vector fields with formal degrees satisfies T> {Kq, A), and 
controls 7. The above discussion shows that if this is true for M, it is true for any larger choice of M, 
and the resulting finite lists of vector fields with formal degrees are equivalent. 



^^As already noted in Definition (53] given J-", all choices of {X,d) are equivalent. 
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12 Surfaces revisited 

In this section, we introduce the technical resuhs necessary to deal with our assumptions on 7 in Section 
ini Before we enter into details, we point out the formal connection between 7 and W. Recall, from 
Section [SI W {t,x) is defined by. 



H^(M) = | 



letoji'ix). (12.1) 



From this we see that, given 7, we obtain a C°° vector field W (t), depending smoothly on t, satisfying 
W (0) = 0. The reverse is true as well: 

Proposition 12.1. The map ^ 1^ W is a bijective map from, smooth Junctions ^t, with 70 {x) = x 
(thought of as germs in the t variable), to smooth vector fields depending on t, W {t), with W^ (0) =0 
(also thought of as germs in the t variable). 

Proof. Let lo (e, i, x) be the (uniquqfj) solution to the ODE: 

—oj{e,t,x)^-W{et,uj{e,t,x)), w (0, t, x) = x. (12.2) 

de e 

We will show that the (two-sided) inverse to the map 7 H> VF is given by 7^ {x) :— w (1, i, x)o First we 
show that 7ct [x) = uj (e, t, a;), where lo corresponds to W obtained by (|12.1|) from 7. Indeed, consider. 



d_ 
~dk 



let [x) = — 
e— Co 



— 7e(eot) {x) = —W (eot, 7eot (x)) , 
£=1 ^0 eo 



where in the last equality, we used the definition of W. Thus 7^4 [x) satisfies the ODE (|12.2p . Hence, 
7* [x) = UJ (1, t, x), and we may, therefore, reconstruct 7, given W. We therefore have constructed a left 
inverse to the map 7 n- W^. 

Now we must show that if W is as in the statement of the proposition, and if we let w be the solution 
to the ODE IT^ . then, 

— uj{l,et,x) = W{t,uj{l,t,x)). (12.3) 

d^ e=l 

I.e., we must show our left inverse is also a right inverse. (|12.3p will follow from (|12.2p if we can show, 
u! (1, et, x) = Lo (e, t, x). This, in turn, will follow if we can show for every eo, uj (eoe, t,x) = uj (e, eoi, x). 
We already know that uj (e, eoi, x) satisfies the ODE (|12.2p with t replaced by eoi. Thus, we need only 
show the same is true for uj (eoe, t, x). However, 

— a;(eeo,t, x) = eo — M^ (eeoi,a; (eeo, i, x)) = -W^ (eeot, w (eeo, t,x)) . 
de eeo e 

Since uj (Oeo,i,a;) — x, by definition, this completes the proof. D 

The correspondence between 7 and W given by the proof of Proposition 112.11 will be an essential 
point in what follows. 

Now we turn back to the perspective used in Section [TTJ fixing 6 and xq and working with {SX, ^ d) 
near xq. Let V. C M" be open, and let H.' <b H." <s H. be open, relatively compact, subsets of ft. 
We assume we are given a list of C°° vector fields with single-parameter formal degrees [Z,d] = 

(Zijdij , . . . , (Zq.dqV dj G (0, 00). We fix xq G fl' and assume that iz,d] satisfies all of the assump- 
tions of the vector fields and formal degrees of the same name in Section [TT] (and therefore we are given 
some 1 > ^ > 0). We also assume that we are given a C°° function 

7(i,x) :B^(p) X n" ^n, 



''^It is easy to see that the solution is unique, using that W (0) = 0, W is smooth, and the integral form of Gronwall's 
inequality. 

^^It is easy to see, via the contraction mapping principle, using the fact that W (0) = and W (t) is smooth, that the 
solution of III2.2II exists up to e = 1 for t sufficiently small. 
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where p > is some fixed positive number, and 7(0,x) = x. Here p > is small enough that for 
t e B^ (p), 7t"^ exists. 

For the primary example where we will apply the results from this section, take the same setup 

as in Section [ini Let j,k G — log2 ^ and set jo — j A k. We will take [Z,d] = (2^^"X,^d) and 
7t = 72-Jot- It is essential that all of the constants in this section can be chosen to be independent 
of xo G Kq and j, fc G — log2 A. Note that our assumption on the list {X, d) in Section [5] implies that 

[Z,d] = (2~^°X, Y^ d) satisfies all of the assumptions of Section [TT] uniformly in the above parameters. 

Thus, m-admissible constants (as defined in Section [TT|) can be chosen to be independent of the above 
parameters. In particular, the results of Theorem 111.11 hold uniformly in the above parameters. 

We let no,r]i,^i > 0, and $ : S"" (771) — > B/^^\ (xq,^) be as in Theorem lll.il Furthermore, we let 
Yi, . . . ,Yq be the puUbacks of Zi , . . . , Zg as in Theorem 111.11 In order to work with our assumption 
that (X, d) controls jt (see Definition 16.31) , we introduce the following two conditions on 74 , which will 
turn out to be equivalent. All parameters that follow are assumed to be strictly positive real numbers: 



1- Qi (pi,Ti, WT}mei'i) (/'I ^ Pi ■''1 ^ Ci)- Foi" ^ ^ ^'1 define the vector field, 

7,, 07-1 (a;). (12.4) 



W{t,x)^^^ 



We suppose, 

• W{t,x) =J2LiCi{t,x)Zi{x), on B(2,|) {xo,ti), 

' ' '- C'°(S"(pi)xB(2,d)(^0,ri)j 

• Note that we may, without loss of generality, assume that ci (0, x) = 0, as we may replace 
ci {t, x) with Q (i, x) — ci (0, a;) for every I by using the fact that W (0, x) = 0. 

2. Q2(p2,T2,{a2"}„gpj): 

. 7 (i?^ (p2) X B^^j^ ixo,T2)) C B^^^^^ (a;o,6), 

• If 77' = 77' (T2) > is a 2-admissible constant so smalrl that, 

$ (B"o ir,')) C i?(^,^) {xo,T2) C $ (i?"o (^^)) , 

then if we define a new map, 

Ot (u) = $-1 o 7t o $ (u) : B^ (p2) X B"° (77') ^ B"° (771) , 
we have, 

ll^llc'"(S«(p2)xB"o(^')) < cr™- 

Remark 12.2. The particular parameters in Qi and Q2 are not important. All that is important is that, 
in our primary application, they may be chosen independent of j, k and xq. At times we will keep track 
of the parameters to make clear the interdependence between the various constants, but the reader may 
wish to ignore these details on a first reading. 

Proposition 12.3. Qi <;=> Q2 in the following sense: 

• Qi (Pii ''"1' {"'"ImGN) ^ there exists a 2-admissible constant p2 = P2 {piiTi,a\,N\ and m + 1- 
admissible constants cr™ = cr™ i'^T^ ' ^) such that Q2 (p2, -y, WT} mefi) Isolds. 

• Q2 {p2,T2,{a™}^i-j^) ^ there exists a 2-admissible constant ti = ri (T2) > and m-admissible 
constants ct™ = tr™ {'^T'^^ ^ ^) ' ■s'^^/i that Qi (p2,n, {c^l^gN) /lo/ds. 



^^Such an rj' exists by Proposition 1 11.2] 
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Proof. Suppose Qi (pi,Ti, WT}meN) holds. Fix p2 > small, to be chosen in a moment. Suppose 

and so there exists to G B^ (^2) and y G B/^ j\ (xq, -y) such that z = 74^ (y). We wish to show that 
z € B/^ j\ (xo,Ci) (by appropriately choosing ^2)- To do so, it suffices to show z e Br^g\ {y, ^), as 
then we would have z G Br^ j\ [xq, ti) C Br^ j\ {xq, fi). Define a curve k : [0, 1] — ?> il by 

'^ (s) = isto (y) ■ 

Note that k{0) = y and k (1) — z. Also, 



.1 9 1 

— K (s) = -Ty (sio, '« (s)) = y^ -Q (sio, «; (s)) ^; {n («)) . 



1=1 



Sincqf^ q {sto, k (s)) =0, the mean value theorem shows that, for < s < 1, 



-Q (sto,K(s)) 
s 



< sup 

0<s<l 



— C; (sioiKCs)) 

as 



< sup 

0<s<l 



< 



1*0 • VtQ (sio,'«(s))| +^ 
(=1 



-ci {sto)Zici ) (sio,'«(s)) 



where we have used that iq {sto, ■) < crj^, and in the last line, we are thinking of q (sio) Zici as the 
vector field q (sto, •) Z; (•) acting on the function ci (sto,-). 

Thus, j^k{s) — '^'i^idi (s) Zi {k{s)), where \ai\ < |to|. By taking |to| sufficiently small (i.e., by 
taking p2 sufficiently small), we may insure that z = k (1) G Br^ ^\ (y, ^) . 

Hence, it makes since to define rj' > and 9t as in the definition of Q2. It follows directly from the 
definition of 9 that 9o (u) = u. Furthermore, pulling back (112. 4p via the map $ shows that. 



de 



o9^\u) = W{t,u), 



(12.5) 



e=0 



where. 



W{t,u)=Y,ci{t,u)Yi{t 



1=1 



Cl (t, u) = Cl (t, $ (u)) . 



We claim, 



W 



C"'(S"(p2)xB"o(,,')) 



< c„ 



(12.6) 



where C„i = Cm (c™) is an m-admissible constant. Indeed, we already have the result for Yi by (111.11) 
Hence it suffices to bound the C™ norm of c;. However, applying (a slight modification of) (|11.2p r^l 

c"{B«{p2)xB"o(n')) ■ 



\a\ + \p\<m 



(12.7) 



Note that. 



Y' 



'afQ(t,«) = (z"afQ)(t,<i>(u)). 



^^As remarked above, we may assume c; (0, x) = 0. 

^^This slight modification can be proved easily by combining mi. Il l and 1111.31 1. 
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Combining this with the fact that, 
we have from (|12.7p . 



Q| + |/3|<m 



Co(s"(pi)xi3(2,j)(2;o,ri)) 



<a^ 



completing the proof of (|12.6p . 

To complete the proof of Qi ^ Q2, we bound the C™ norm of 0. For t G B^ (P2), u £ 5"° (f?')i ^-i^*^ 
e £ [0, 1], define a function, 

Lu {e,t,u) = 9 {et,u) . 

In light of (I12.5p . uj satisfies the ODE in the e variable: 

-—OJ (e, t, u) = —W (et, LU (e, t, u)) , w (0, t, u) = u. 
de e 

Using that W (0, u) = and that W is smooth, standard theorems for ODEs show that there exists 
cr™ = a^ {Crn+i,m, N) such that, 

l!'^llc'"([Oa]xS"(p2)xS"o(r,')) - '^T- 

Since 61 (t, u) = a; (1, i, m), this completes the proof of Qi =^ Q2- 
Suppose that Q2 (^2,^2, {f^lmgrj) Isolds. Define, 



iy(i,u) = - 



Jet " i^t 



(«)• 



e=l 



Then, 



W 






C'-(B"(p2)xB''0 (,,')) 

where Cm depends only on m. Using (lll.ip and (|11.3p . we see that, 

W {t,u) =^Zi{t,u)Yi{u) , 



(12i 



i=i 



and, 



IqII 



< r" 



where C^ = C^ (^™^^) is an m-admissible constant. Applying, again a slight modification of, (|11.2I) . 
we have, 

y] |k"afQ ~™-l ||Q||c...(S«(p2)xi3"0(„M)<C;,. (12.9) 

^-^ II CO(S"(p2)xB"o(,,')) "<-■ ^" (P2)X_« mr, )) 

[Q| + [;3|<m 

Using Proposition 111.21 let t\ = ri (77') > be a 2-admissible constant such that. 



i?(^,j)(a;o,ri)C$(B»o (,/)), 



Pushing (112. 8p and (|12.9p forward via the map $, and taking W as in the definition of Qi, we see for 
X e $(B"" (?7')) andQ(f,a;) = q (i,$-i(a;)). 



W (t,x) =^ci{t,x)Zi{x) . 



i=i 






Co(B"(p2)xS(^^^)(:ro,ri)) 



^m-1 C*™ 



completing the proof. 



D 
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Definition 12.4. We say [Z,dj controls 7 at the unit scale if either of the equivalent conditions Qi 
or Q2 holds (for some choice of the parameters). If we wish to make the point a;o explicit, we will say 
[Z,d] controls 7 at the unit scale near xq . 

Proposition 12.5. Let {X, d) and 7 be as in Section\^ {X, d) controls 7 if and only if {SX, ^ d) controls 
"fit it the unit scale near xq for every 6 € A and Xq G Kq, and the parameters of Qi (equivalently Q2) 
can he chosen independent of Xq G Kq and 5 £ A. 

Proof. Let 5 (z A and Xo G -ft'o- Define 7^ — jst- Note that if W{t,x) is as in Definition 16.31 and if 
W {t, x) is as in Qi, then, 

W{t,x) = W{5t,x). 

Now the result follows just by comparing Qi and Definition 16.21 □ 

In light of Proposition 112.51 to work with our assumption that (X, d) controls 7, we may instead 
work "at the unit scale." For the rest of this section we work in this perspective, with the understanding 
that Proposition 112.51 allows us to transfer our results from the "unit scale" to "every scale." 

Proposition 12.6. If (Z,d] controls j} andjf at the unit scale, then (Z,d] controls^} ojf at the 
unit scale. If iZ,d] controls 7* at the unit scale, then [Z,d] controls ^^ at the unit scale. 

Proof. Both of the above statements arc more easily verified using Q2 . Indeed, the C™ norm of, 

$-1 o 7,\ o 72^ o $ = ($-1 o 7,\ o $) o ($-1 o 72^ o $) 

can clearly be bounded in terms of the C" norms of, 

$"^ o ^l^ o $ and $"^ o 7(2^ o $, 

provided one shrinks the parameters p2 and T2 appropriately (so that the composition is defined). A 
similar proof works for 7^"^. We leave further details to the interested reader. D 

Proposition 12.7. // (Z, d) controls jt o,t the unit scale, and c — (ci, . . . , cjv) G [0, 1] is a constant. 



then [Z,d] controls, 

j" ((ii, . . . , tN) ,x) := 7 ((ciii, . . . , CNtN) , x) , 

at the unit scale. Moreover, the parameters in the definition of control at the unit scale may be chosen 
independent of c. 

Proof. This is obvious from the definition of control (using either Qi or Q2). D 

So far in this section, we have covered the necessary technical details to deal with our assumption 
that {X, d) controls 7. The other part of our assumption was that the vector fields with formal degrees 
{Xj,dj) were generated by vector fields with formal degrees of the form {Xa,deg{a)), where t"Xa 
was a term in the Taylor series of W {t,x), when taken in the t variable, and deg (a) was nonzero in 
only one component. We now turn to the various definitions and theorems necessary to deal with this 

assumption. Again, we work at the unit scale. We continue to take I Z, d) , 74 (x), and xq as above. We 
assume, throughout the rest of the section, that ( Z.d] controls 7 at the unit scale (near xq). 



Definition 12.8. Let {Zi,..., Zr} be a subset of {Zi, . . . , Zg}. For M G N, we say Zi, . . . , Zr M- 
generates Zi, . . . , Zq if each Zj {r + I < j < q) can be written in the form, 

Zj=ad(Z,Jad(Z,J---ad(Zj,jZ,„^^,, 0<m<M-l, I < h < r. 
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Define W {t, x) as the vector field, 



W (t, x) 



de 



%t o It ^ (2;) 



Let Za (x) be the Taylor coeSicients of W {t, x) when the Taylor series is taken in the t variable: 

T?(i,x)- Y^ eZo^ix). 

q|>0 

Definition 12.9. For Mi,M2 G N, we say 7 Mi,M2 generates Zi,...,Zq if there is a subset J' C 
{Za ■ |a| < Ml} n {Zi, . . . , Zq}, such that T Af2-generates Zi,..., Zq. 



Since we are assuming that [Z,d] controls 7 at the unit scale, we may pull 7 back under the map 
$ as in Q2 to obtain the C°° map: 

9t {u) = $-1 o 7t o $ [u) : B^ (p2) X B"« (r/) ^ B"« (r,) , 

as in Q2. As usual, define Fi, . . . Yg to be the pullbacks oi Zi, . . . , Zq under the map $. Define, 



W{t,u) = - 



(u). 



e=l 



Let Ya (u) be the Taylor coefficients of W expressed as a Taylor series in the t- variable: 

\a\>0 

It is immediate to verify that Ya is the pullback of Za under the map <&. 
Theorem 12.10. Suppose 7 Mi,M2 generates Zi, . . . , Zq. Then, the set of vector fields 

{Ya : \a\ < Ml} , 



(12.10) 



satisfies Hormander's conditions of order M2 on _B"" (77'). More specifically, if we let Vi, . . . Vl denote 
those vector fields in (|12.10p along with all commutators of those vector fields in (|12.10l) up to order 
M2, then. 



det V{u] 

noX"o 



>2 1, yue B^° (77') , 



where we have written V to denote the matrix whose columns are Vi, . . . ,Vl- In short, 9 satisfies (Cz)- 

Proof. By assumption a subset of {Za '■ \a\ < Mi} M2-generates Zi, . . . ,Zq. Pulling this back under 
<& we see that a subset of (|12.10p Af2-generates Yi,...Yq. I.e., each of Yi,...Yq appears in the list 
Vi,...Vl. Thus, 



det V{u] 

ngxno 



> 



det Y{u) 

"oxno 



>2 1, Vm e B"" (?/ 



In the last inequality, we have used (|11.3p . 

We now combine the above discussion to obtain the main results of this section. 



D 



Theorem 12.11. Suppose Zi,...,Zr M2-generates Zi,..., Zq. Let 7' (t, x) : B^ (p) x Q" -^ n, I 
1, . . . ,k, be a sequence of k C°° functions, such that each 7' is controlled at the unit scale by [ Z,d 
Let 



W,it,x) = - 



let ° (7* ) (a;) , 
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and define the vector fields Vj^a to be the Taylor coefficients (in t) of Wj : 

W,{t)^Y.^'^V,,^. 

Suppose there is Mi G N such that each Zs (^ < s <r) appears as some Vj^a for \a\ < Mi. Define, 

7(ti,...,tO (a;) = 7t\ ° 7t^2 ° • • • ° ^t (^) ■ 

Then, 7 is controlled at the unit scale by iZ,dj. Moreover if we take 9 as in Q2, then 9 satisfies 

(Cz), where the constants in the definition of [Cz) are 2- admissible constants depending on the above 
parameters (Mi, -M2J. 

Corollary 12.12. Take the same setup as Theorem \12.11[ and, more precisely, assume eachj'' satisfies 
Q2 (p2, '''2, {c^l^gpf) . Then, 9 satisfies (Cj) (at 0), where the constants in the definition of (Cj) are 
2-admissible constants depending on the above parameters (Mi, M2, k, p2, T2, {(^2^} men) ■ 

Proof of Corollary \1 2. lS\ given Theorem ] 1 2. 11[ Letting 7^,..., 7*^ vary over all choices satisfying the 
hypotheses of Corollarv ll2.12l (for a fixed choice of the parameters), Theorem 112.111 shows that the set 
of all such 9, so obtained, satisfies (C^). Since {o'2^}„ieN ^^ fixed, the set of all such 9 forms a bounded 
subset of C°°. The corollary now follows by applying Theorem 19.41 which states that (C^) ^ (Cj) for 
bounded subsets of C°°. D 



*^<" = s 



Proof of Theorem \12.11\ That 7 is controlled at the unit scale by [Z,d] follows from Proposition ll2.6l 
Define, 

7.*°7r'~ E ^"^- 

1 |q|>0 

In light of Theorem ll2.101 we need only show that each Zi, . . . , Zr appears as some Za with |q| < Mi. 
Since, 

it is easy to see that, 

t? (0, • • • , 0, tj,0, • • • 0, x) = Wj {tj,x) . 

Since each Zi, . . . ,Zr appears as a Taylor coefficient of some Wj (with \a\ < Mi), the same is true for 
W . This completes the proof. D 

Though we will work in slightly greater generality in the sequel, it is worthwhile to take a moment 
and see the main consequence of our assumptions on 7 in Theorem 17. 2[ in light of Corollary 112.121 

Proposition 12.13. Let {X,d) and 7 be as in Theorem \7.2\ i.e., {X,d) and 7 satisfy all of the as- 
sumptions of Section\^ Fix xq G Kq and 5i,52 G A. Let 5o = 5i\/ S2 (Sq E A by our assumption on 
A). Define, 7 as either, 

7(ti,t2,t3,u) i^) = 7^;^ ° ^S2H ° Is2t2 ° ^-51*1 (2^) ' (12-11) 

or, 

l{tut2,UM) i^) = 7<5it4 ° 7^2*3 ° l&2t2 ° l5^]^ i^) ■ (12.12) 

Then, (SqX, ^ d) controls 7 at the unit scale near xq (and the parameters in the definition of control 
may be chosen independent of 5i, S2, and Xq). Moreover, if we define 9t {u) as in Q2, then 9 satisfies 
(Cj) (at 0) uniformly in 61, 62, and xq. More specifically, if we let S be the set of all such 9 (as Si, 62, 
and Xq vary) then S satisfies (Cj). 
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Proof. Let (Z,d] = {SoX,'^d). By Proposition 112.51 jSot is controlled at the unit scale by {Z,d). 

Proposition 112.71 then shows that ^Sit and js^t are controlled by [Z,d] at the unit scale. Proposition 

I12.6l applies to show that "y^\, 77 1' ^^^ 7 ^'^^ ^^^ controlled at the unit scale by (Z,d) (uniformly in the 
relevant parameters). The proof will then be completed by combining Corollarv ll2. 121 with the following 



lemma. 



Lemma 12.14. Taking the same setup as in Proposition \12.T^ and letting [Z,d] = {SoX,'^d). 

There is a subset Zi, . . . , Zr of Zi, . . . , Zq such that Zi, . . . , Zr M2-generates Zi, . . . , Zq. Moreover, 
each Zi, . . . , Zr appears as a Taylor coefficient (in t) of either 



de 



75i(et) ° Is,] i^) or Ws^ {t, x)^ — 



l5^,{et)0 1l,.^{x), 



and this Taylor coefficient may be chosen to be the coefficient oft"', where \a\ < Mi. Here, Mi,M2 G N 
may be chosen independent of 5i, 62, and xq. 

Proof. First we recall the definition of {X, d). We wrote, 

d 



Wit,.) ^^ 



7,-,! 07,(2;)^ ^ t"X„. 

|a|>0 



We took a finite subset of 

{{Xa, deg (a)) : deg (a) is nonzero in only one component} , 

denote this finite set by (Xi, di) , . . . , (X^, dr), and assume each [Xi, di) , . . . , (X^, d^) appears as (X^, deg (a)) 
for some |a| < Mi. Take Zi, . . . , Z^ to be 5'^^Xi, ..., d"^^ X,.. 

We assumed that [Xi, di) , . . . , {Xr, dr) generates the finite list {Xi, di) , . . . , {Xq, dq). That is, each 
{Xj,dj) can be written in the form, 

X, -ad(X,Jad(X,J...ad(X,„_,)Xz„, 
dj = di^ + di^ -\ + dj,„, 

where 1 < Z^ < r for each k, for some 1 < m = m{j). We choose M2 G N so that m < M2 for every j. 
Note that if we take (z, d) = {SqX, J2 d) then (|12.13p shows. 



Z, = ad (Zi J ad (Zi,) • ■ • ad (Z;„._ J Z,„, 

for the same choice of ^i, . . . , /„• Hence, Zi, . . . , Zr M2-generates Zi, . . . , Zq. 

To complete the proof, we must show that each Zj (1 < j < r) appears as a Taylor coefficient of 
either Wg-, or Ws2- Fix j (1 < j < r), and suppose that dj is nonzero in only the fi component (by 
assumption we know it is nonzero in only one component). Since Sq — (5i V(52, 5^ G {(5^, d!^}. We suppose 
that 6q = S'^, the other case is similar. We will show Zj is a Taylor coefficient of Wg-^. Indeed, 

Ws, it,x) = WiSit,x) ^ J2 i"^?°^^"^^«- 

|a|>0 

Take |a| < Mi such that {Xj,dj) = {Xa,(ieg{a)). Since dj is nonzero in only the jjl component, we 
have, 

rdog(a) ^ _ c-dj ^ ^ X^-i V _ 7 

Oi ^^OL — ^1 ^^1 — ^0 i — ^j^ 

completing the proof. D 

Remark 12.15. Lemma 1 12. 141 is the only place where we use that {Xi, di) , . . . , (Xr, dr) were chosen with 
each dj nonzero in only one component; i.e., that {X, d) was generated by "pure-powers." Note, though, 
that this is essential to Lemma [12.141 
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12.1 Multi-parameter curvature conditions 

Throughout the paper, we work mostly with the curvature conditions (Cz) and (Cj). Indeed, using the 
notation in Proposition ll2.l31 the essence of SectionfT^was that 6 satisfies (Cj) uniformly in any relevant 
parameters (and this followed by showing that it satisfied {Cz) uniformly in any relevant parameters). 
While (Cz) seems to be the best suited to theoretical applications, (Cg) is perhaps more intuitive. The 
reader may, therefore, wonder if there is an analog to Theorem 19. II in our setup. Indeed, there is, and 
this section is devoted to discussing this fact. The results in this section will not be used in the sequel 
(except briefly in Section fl 2. 3p . and the reader may safely skip this section on a first reading. 

The following curvature conditions, we think of as being "scale invariant." We, therefore, denote 
them by (Cg), (Cf ), and (Cf) (we discuss (Cj) in Remark Fl 2. 17p . The condition (C|) is precisely the 
condition assumed on 7 in Section |6l We restate it here. In what follows, 7 {t,x) : B^ (p) x il" —> il is 
a C°° function with 7 (0, x) = x, where il" and fl are as in Section [6l 



(C|): Define, 



»'(«-| 



7et O 7j 1 (x) . 



e=l 



Write W (t) - Y.a *"^», and define 

V = {{Xa, deg (a)) : deg (a) is nonzero in only one component} . 

(C^) states that there is a finite subset J^ C V such that J^ generates a finite list and this finite list 
controls 7. Note that both "generates a finite list" and "controls" in the above depend implicitly 
on the set A C [0, 1], the chosen compact set Kq (e 51", and the multi-parameter dilations e. 



(Cg): Write 74 ~ exp (X]u|>o^""^") • (^b) ^^ *^*^ same as (C^) except V is replaced by 

V = < ( Xa, deg (a) j : deg (a) is nonzero in only one component > . 
(Cf ): For each j, I < j < N, define. 



iy,(.)^.7(^,7r^(.))(J-) = ^ 



Jt+s o 7t ^ {x) . 



s=0 



Write Wj {t) ^ J2a ^^^aj- (Cy) is the same as (Cf ) except V is replaced by 

V ~ {{Xa.j,deg (a) + bj) : deg (a) + Cj is nonzero in only one component} , 

where e denotes the chosen multi-parameter dilations. 

Proposition 12.16. (q) ^ (Cf ) ^ {C'z)- 

Proof sketch. Using the Campbell-Hausdorff formula, the vector fields Xa can be written in terms of 
the vector fields Xa (respectively, Xa.j)- This can be seen just as in Proposition 9.6 of jCNS W99] . It 
is not hard to see, using the form of this correspondence (which we have not made precise), that (C^) 
and {Cp) are both equivalent to (C^). D 

Remark 12.17. Under the equivalent assumptions (Cf), (Cg), and (Cp), the assumptions of Proposition 
ll2.13l hold. The conclusion of Proposition ll2.13l can be seen as (C}). Note that each of (Cf ), {Ct), and 
(Cy) comes with an intrinsic family of vector fields from which we can construct the scaling map ^xo,s 
(for each 5 € A, xq G Kq), which we use to make sense of (C}). There is no such intrinsic family of 
vector fields corresponding to (C}), and we therefore cannot discuss any sort of reverse implication in 
this sense. Of course, if we fix a choice of the list of vector fields {X,d), then all four conditions are 
(in a sense) equivalent, since the pullback of them under $ corresponds to (C^), (Cg), (Cy), and (Cj), 
respectively (which we know to be equivalent). More specifically, if we define 9f°' = ^~^ g o jst ° ^xo,s 
then {9^<''^ : xq G Kq, 5 e A} satisfies (C^), (Cg), and {Cy), respectively, if and only if 7 satisfies (C|), 
(Cg), and {Cy) respectively. We then say that 7 satisfies (C}) if {O'^o-^ : xq G Ko,5 e A} satisfies {Cj). 
This idea can also be used to provide another proof of Proposition 112.161 
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12.2 Proof of Proposition 111.121 

In this section, we exhibit the proof of Proposition 111.121 Let iz.d) be vector fields with single- 
parameter formal degrees satisfying the assumptions in Section [TT] for a fixed xq e fi; i.e., the same 
setup as in Section[T2l For each multi-index a with \a\ < L, let Va be a C°° vector field. Though it will 
not play an essential role in this section, assign to Va the formal degree |deg(a)|-^. Let % be given by, 

J^ (x) = e^o<i°i<-f'*°^°a;. 

Proposition 111.121 will follow immediately by applying the following lemma with iz,d] = {SX, ^ d) , 
with 5 Cz A, and xg G K , and taking % = "fst- 



Lemma 12.18. lZ,d] controls 7 at the unit scale near xq if and only if iZ,d] controls (Va, |deg (a)|]^) 

at the unit scale near xq, in the sense that if j satisfies Q2 then {Va, |deg(a)|j^) satisfies 1^2 and the 
parameters of V2 can be chosen to be 2-admissible constants depending also on the parameters of Q2. 
Conversely, if (Va, |deg (a)|]^) satisfies V2 for each a, then 7 satisfies Q2 and the parameters for Q2 can 
be chosen to be 2 admissible constants depending also on the parameters for V2- 

Proof. Let C/q be the puUback of Va via the map <&, and let 9t (u) = $^^07(0$ (u). Note that, 
Then, the statement of the lemma can be restated as saying, for some p2, ?/2 > 0, 

if and only if, 

This is well known. D 

12.3 T* is of the same form as T 

In this section, we prove the following proposition. 

Proposition 12.19. IfT satisfies all of the assumptions of Theorem \7.S[ then so does T* , where T* is 
the L^ adjoint of T . 

Recall, T is of the form, 

T if) (x) = Vi (x) I f [it {x)) ^2 {it [x)) K (i, x) K (t) dt; 



see Theorem 1 7. 2 1 for further details. Since we are assuming K (t) has small support (and we are allowed 
to choose how small, depending on 7), we may without loss of generality assume that det ^ it,x) > i 
on the support of K (since det ^ (0, x) = 1). Note that, 

T* (/) (x) = iJ^)J4'i{lt'{x))n{t,i-\x))f {1-' [x)) (^det g {t,i;' (x))) K [t) dt 
= ^hix) j i;,{i^^{x))K {t,i-' (x)) f {1-' {x))K{t) dt, 

where k {t, x) = k {t, l^^ (x)) (det ^ {t, if^ (x)) j . From here it is immediate to see that Proposition 
112.191 follows from the following lemma. 

Lemma 12.20. it satisfies all of the hypotheses of Theorem \7.°^ if and only if ii does. 
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Proof. We show that jt satisfies (C*) if and only if 7^ ^ satisfies (C^) , and the result will follow from 
Proposition 112. iBl Since the result is symmetric in 74 and 7(~ we show only one direction. 

Suppose 7t satisfies (C*). Thus we assume that our hst of vector fields {X,d) is generated by 

vector fields of the form ( X^, deg (a) 1 , where, jt (x) ^ exp ( X]|q|>o *"^a ) x. However, Lemma 9.3 of 
|CNSW99] shows, 7^"^ (x) ^ exp ( — J2\a\>o ^"^a ) x. Hence we may use the same list of vector fields to 
show that 7(~^ satisfies (Cg). The result will now follow once we show (X, d) controls 7^"^. Rephrasing 
this, it suffices to show that 7^"^ [x] :~ 7^^ {x) is controlled by {Z^d\ :— {5X,^d) at the unit scale 
near a;o, uniformly for xq G Kq and 5 E A. However, our assumption on 7 can be rephrased as saying 
that 7f is controlled by [Z,d\ at the unit scale near xq , uniformly for xq G Kq and ^ G .4. The result 
now follows from Proposition ll2.6l D 

13 The space L] 

In this section, we review the results from Sections 7 and 13 of [CNS W99] . concerning the space Lg 
(which is defined below). The situation we are interested in is as follows. ^P will be a C°° mapping on 
the closure B of a finite ball B in W^ mapping to R", with d > n. Consider a measure ^ (r) dr in W^, 
where ip ^ Cq [B). We define a measure /i on R" by the integration formula. 



fiv) dfiiy)= /_/(*(r))V(r) dr; 
Jb 

that is, II is the transported measure dfi — ^, (tpdr). 

The goal of this section is to discuss the fact that, under appropriate conditions, /i is absolutely 
continuous with respect to Lebesgue measure and the Radon- Nikodym derivative h (y) = -^^ possesses 
the following level of smoothness. 

Definition 13.1. For < (5 < 1, Lj (R") is the Banach space of all functions / G L^ (R") such that, 

\h{y^z)-h{y)\dy<C\zf, Vz G R". (13.1) 

The norm on L^ is defined to be ||/i||^i plus the smallest C for which p3.ip holds. 
Proposition 13.2 (Proposition 7.2 of |CNSW99] ) . Suppose that, for some multi-index a, 



9 v. 5*/ ^ 

— det^— r 

dr / nxn Ot 



y^ 0, Vr G B. (13.2) 



Then the transported measure dfi = ^* [ipdr) is absolutely continuous with respect to Lebesgue measure, 
and its Radon-Nikodym derivative h belongs to L\ for all 5 < [2 \a\) . Moreover, the L\ norm of h can 
be bounded in terms of an upper hound for the C'"'"*"^ (S) norm of ^ , a lower bound for the left hand 
side of (|13.2p in B, an upper bound for the C^ norm ofip, the number S, and an upper bound for \a\. 

Fix constants Ci,C2 < 00 and let ( G (0, 1]. Consider a non-negative measure S on R^" with the 
following properties. 

. supp S C {(y, z) : \y\ , \z\ < Ci, \y ~ z\ < CiC}. 

• There exist bounded, nonnegative, measurable functions rrii, 7712 such that for every f £ C''^ (R"), 

/ (y) dE {y, z)^ f (y) mi (y) dy, 

f (z) dE {y, z)^ I f (z) TO2 (z) dz; 

with, 

TOi (y) ,m2 (z) < C2. 
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Proposition 13.3 (See Proposition 13.1 of |CNSW99] ) ■ Suppose h E Ll (M") and E is a measure as 
described above. Then, there exist 5' , A G (0, oo) such that, 



\h[y)-h{z)\ dE{v,z)<AC' \\h\\ 



Li' 



where S' depends only on S and n, and A depends only on S, n, and upper bounds for the constants 
Ci , C2 . 

Proposition 113.31 is slightly different than Proposition 13.1 of |CNSW99] . Superficially, we have 
renamed 2~'^ in ^CNSW99] to be ( here, and taken j and x in |CNSW99] to be 0. Furthermore, we have 
removed any aspect of the "lifting" procedure from |CNSW99] and we have replaced the quasi-distance 
d with Euclidean distance. Finally, we need not assume Ci small as is done in |CNSW99] (though it 
would suffice for our purposes to take Ci small). In any case, the proof in [CNSW99] immediately gives 
Proposition 113.31 as we have stated it. 

14 A general L^ theorem 

In this section, we state and prove a general L^ theorem, which we will use to complete the proof of 
Theorem 17.21 The goal is to develop a theorem which will imply 

{T;nnT,f ^^ ^ , {T,nTuT;f ^^ ^ <2-^l^-'=l, (14.1) 



L'^^L'^ 



L^^L^ 



for some e > 0, where Tj and Tk are as in Section [TUl We state this result in slightly greater generality 
than we presently need, as the more general result will play an essential role in [SSllai ISSllb) : in 
addition, this more general version only requires a small amount of extra work. Throughout this section 
we take Kq (e 51' d fl" (s 51, as in Section [HI a > will be a small number to be chosen later, depending 
on the particular operators we study. We will see, though, that a > may be chosen independent of 
j,k in (pXTj) . 

The setting is as follows. We are given operators 5*1, . . . , 5l, Ri and R2, and a real number C G (0, 1]. 
We will present conditions on these operators such that there exists e > with, 

||5i ■ ■ ■ Sl{Ri - R2)\\l2^l2 < C- 

In Section [15] we will show that the assumptions of this section hold uniformly when applied to (|14.1I) , 
in an appropriate sense, and this will allow us to establish (|14.ip . and complete the proof of Theorem 

o 

To describe our assumptions we suppose we are given C'°° vector fields Zi, . . . , Zq on Q with single- 
parameter formal degrees di, . . . , dglfj We assume that (Z,d] satisfies all of the assumptions of Theorem 
111.11 uniformly for xq G Kq, for some fixed ^ > 0. Furthermore, we assume that r of the vector fields, 
Zi,. . . ,Zr, M-generate Zi,. . . ,Zq, for some M > (see Definition 112. 8p . 

We now turn to defining the operators Sj. We assume, for each j, we are given a C°° function 
7j : i?^J (p) X 51" — !■ SI satisfying jj {Q,x) = x. We assume that each jj is controlled at the unit scale 

by (^, djcfl As usual, we restrict our attention to p > small, so that 7"^^ makes sense wherever 

we use it. We suppose we are given '0i,iiV'j,2 G C^ (M") supported on the interior of Kq and kj G 

C°° is^i (a) X fy] . Finally, we suppose we are given ^j G C^ {B^^ (a)). We define. 



S'jf [x) = V-jM {x) / / {l],t [x)) il::ja {lj,t [x)) Hj (t, x) ^j [t) dt. 



■^^The single-parameter degrees do not play an essential role in this section, and are only present to facilitate our 
applications of the results in this section. 

^*By this we mean that they are controlled by {Z,d\ at the unit scale near xq for every xq g Kq, uniformly in xq. 
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Note that, under the above assumptions, 

\\SJh^^L^,\\SJh^^L^<l; (14.2) 

and furthermore S* is of the same form as Sj with 7^-4 replaced by 7"^^ (c.f. Section ri2. 31) . 

Definition 14.1. If Sj is of the above form, we say that Sj is controlled by iz,d] at the unit scale. 

Remark 14.2. Since jj.t is controlled at the unit scale if and only if 7^/^ is (Proposition 112. 6|) 5"^ is 
controlled at the unit scale if and only if S* is. Furthermore, if jj-^.ti and 7^2. t2 ^'"^ controlled at the 
unit scale, then so is "fj-i,ti °lj2,t2 (^Proposition 1 1 2 . 6|) . and therefore, if Sj-^ and Sj^ are controlled at the 
unit scale, then so is Sj^Sj^. 

We assume, further, that for each Z, 1 < ^ < r, there is a j (1 < j < L), and a multi-index a (with 
I a I < B, where B € N is some fixed constant which our results are allowed to depend on), such that, 



a\ ot 



d 
1 de 



e=l 



7j,et ° lj,t i^) ■ 



This concludes our assumptions on Si,. . . ,Sl- 
Remark 14.3. Note that 71, ... ,7/, were chosen so that 

7(ti....,tL.si,...,si,) (x) ^lL,t^ oiL-i,t^^, 0---011M °7mi °l2X °---°7r,s^ (x)- (14.3) 

would satisfy the hypotheses (and therefore the conclusions) of Theorem 112.111 and Corollary 112.121 

We now turn to the operators i?i and i?2. It is here where the number C plays a role. We assume 
we are given a C°° function ^t,s (with 70.0 (x) = x), which is controlled by iZ,d] at the unit scalei 
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7 (i, s, x) : B^ [p) X [-1, 1] X O" ^ f], 70,0 [x) = x. 

Remark 14.4. Here, we are thinking of {t,s) as playing the role of the t variable in the definition of 
control. 

We suppose we are given K (t, s, a;) G C°° [b^ {a) x [-1,1] x rj"V ^{t) G L^ {B^ {a)), and?Ai,?A2 6 

C^ (M") supported on the interior of Kq. We define, for S, G [—1, 1], 

i?«/ (x) = ^1 (a;) y / (7t,5 (x)) ^2 (7t,? (x)) ^ (i, C, a^) ?{t) dt. 

Note that we have, 

lli?^ll lli?^ll < 1 fl4 4) 

We set Ri = R'^ and R2 = R° . 

Theorem 14.5. /tt. the above setup, if a > is chosen sufficiently small, we have, 

\\Si---SL{Rl-R2)h2^L-<CC, 

for some e > 0. 

Remark 14.6. It is important that a, C, and e may be chosen independent of any relevant parameters. 
We say merely, that a, C, and e can be chosen to depend only on the norms of the various functions 
used to define Sj, Ri, and R2, on the parameters B and M, on L, on the various dimensions, on the 
parameters in the definition of control (when using Q2), and on anything that 2-admissible constants 
were allowed to depend on as in Section[TTJ The reader wishing to, should have no trouble keeping track 
of the various dependencies in our argument. 



^®In most of our applications, 7t,s will be of the form -fs-t, where 7t is controlled by (Z,d] at the unit scale and s ■ t 
scales each coordinate of t by a non- negative integer power of s. 
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The rest of this section is devoted to the proof of Theorem 114.51 The reader not immediately 
interested in the details may safely skip to Section [15] to see how Theorem 114.51 is used. 

Remark 14.7. Note that in the equivalence between Qi and Q2, we used that we were able to shrink 
the t variable. For instance, to define 6 in the definition of Q2, we restricted attention to \t\ < rj' , where 
77' w 1. Thus, we must be able to restrict attention to s < c where c > 1 when working with jt,s- 
Since, in the definition Ri and R2 we fix s to be C and 0, respectively, it suffices to note that we may 
restrict attention to C. < cin Theorem ll4.5l This is clear, since when 1 > C ^ c. Theorem 114.51 is trivial. 
Henceforth, we assume C is small enough that all our definitions make sense. 

Define S = Si ■ ■ ■ Sl and R = Ri — R2. In what follows, e > will be a positive number that may 
change from line to line. We wish to show, 

\\SRh.^i^.<C, 

and it suffices to show, 

\\R*S*SR\\^,^^,_<C- 
Using (|14.4p . we have ||i?*||^2_j.^2 ^ 1, and therefore it suffices to show, 

\\S*SR\\^,^^,<C- 
Continuing in this manner, it suffices to show, 

(S*Sf'" R < C, (14.5) 

for some m > 0. Since ||S'||^2^i2 , ||5'*|j^2^i2 < 1 by (|14.2p . it suffices to show, 

11(5*5)" i?||^2^^2<C; (14.6) 

where we have just taken m so large 2™ > n and applied (jl4.5p . 
Combining (|14.2[) and (|14.4p we see, 

||(5*5)"i?||^.^^. <1, 

and so interpolation shows that to prove (jl4.6p we need only show, 

\\{S*SrR\\^^^^^<C. (14.7) 

Let / be a bounded measurable function. Rephrasing (J14.7I) . we wish to show, 

\iS*SrRfixo)\<C\\f\\L^, (14.8) 

for every xo € Kq (where we have used that fact that S*g is supported in Kq for every g). We now fix 
xq and prove (|14.8p . All implicit constants in what follows can be chosen to be independent of xq G Kq- 
Define, jt by pTH)) . That is, 

7(ti....,ti.,si,...,si,) (x) =lL,tL °7L-i,ti-i o---o7i,ti °liX °%l2 °---°1l!sl i^)- 

Thus, 7 is controlled at the unit scale by ( Z, d) , and S*S is given by, 

S*Sf (x) = Vi {x) J f {it {x)) ^2 (7* (x)) K {t, x) <; it) dt, 

where <;^ G Cg (J3^ (a'))i a' > is a small number depending on a (from here on out, a' > will be a 
small number (depending on a > 0) that may change from line to line), N = X]j=i '^^j^ '* ^ C°° ^ and 
V'l, V'2 G C^ are supported in the interior of Kq. Define, for t = (ii, . . . , i„) {tj e B^ (a')), 

Tr {x) = 7t^ o 7t2 o ■ . . o 7t^ [x) . 
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So that, 

{S*Sr I {x) = V-i (x) / / (r, (x)) ^-2 (r. {x)) K (r, x) c (r) dr, 



where the various functions have changed but are of the same basic form as before. Thus, 
{S*Sr Rf {x) =Vi (x) J f (7t.c ° r, (x)) ^2 ilt.x o Tr (x)) K (t, C, T, x) ^ (t) ? (t) dr dt 

- -01 {x) / / (7t^o o r^ {x)) -02 (7t,o o r^ (x)) K {t, 0, r, x) <: (r) ?'(i) dr dt. 



Here, k and tpi,ip2 are C°°, with i/)i and ■02 supported on the interior of Kq. Now think of xq S -R'o as 
fixed. We wish to estabhsh (114. 8p . The dependence of k on xq is unimportant so we suppress it. Given 
a bounded measurable function /, we wish to study the integral, 

/(/)=/"/ {\^ o r, (xo)) « (i, C, r) <? (r) ?-(t) drdt- J f (7^,0 o T, (xo)) k (t, 0, r) <^ (r) ^(t) dr dt. 

Note that -01 (xq) / ( fip2 ) = (S'*S')" i?/ (xq). Theorem ll4.5l now follows immediately from the following 
proposition, 

Proposition 14.8. For a > sufficiently small, there exists e > and C such that, 

\I{f)\<CC sup |/(z)|; 

^e-S(z.i)(^o,5) 

where a, e, and C ma?/ onZ?/ depend on the parameters the constants of the same names were allowed to 
depend on in Theorem \14.5\ see Remark \14.6] 

We devote the remainder of this section to the proof of Proposition 114.81 First note that it suffices 
to consider only k of the form, 

K{t,C,T) = Kl (t) Ko (t, C) : 

since every k may be written as a rapidly converging sum of such terms. (This reduction is unnecessary, 
but simplifies notation in what follows.) 

Proposition 112.61 shows that [Z,d] controls 74^3 ° Tt at the unit scale near xq, if a > is sufficiently 

small (and therefore t, s and t are sufficiently small), then jt,s°^T (xo) € B/^ jn (xq, ^1) (see Q2). Hence, 

/ only depends on the values of / on Br^ ^\ (xo,Ci). We may, therefore, restrict our attention to / of 

the form g o $, where g is a function on B"" (?/i); as the result would follow by taking g = f o $^^. $ 
here is, as usual, the scaling map from Theorem lll.il and depends on the particular point xq. Note, in 
particular, $ (0) = xq. 
Define, 

dt = $"^0740$, 

e, = $-1 o r, o $ = ?t^ o ^t, o . . . o ^t„ , T = (ii, . . . , i„) , 

Ot,s ^ $"^o7t_, 0$, 
We see, 



(14.9) 



^ (ff ° *) - / 5 {Otx ° e, (0)) K„ {t, C) Ki (r) ? (r) f (t) dr dt 

9 {Ot,o o e, (0)) no (t, 0) Ki (r) c (r) ?{t) dr dt. 
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Note that, since kq is C°°, we have kq (t, C) = ^o {t,0) + O [C) uniformly in t. Combining this with 
(|14.9|) . it is easy to see, 



(14.10) 



I{go^)^ .g 0t,Coe,(O) -5 0t,ooe,(O) ^0 (t, 0) Ki (r) ^ (r) ?-(i) dr di 



Ok sup \g {v) 



Note that the error term in (114. lOp is of the desired form. Thus, let / (g) be the first term on the right 
hand side of (|14.10p . It suffices to bound I {g) . 

Note that, in light of Remark 114.31 Corollary 112.121 shows that 9 satisfies (Cj) uniformly in any 
relevant parameters, i.e., there exists a multi-index /3 (with \/3\ < 1) such that|fj 



Since, 



— det ^ T,0 

OT J noxno OT 



|0|lci^i+i^l' byQ2, 



> 1. 



we see that if we take a > sufficiently small. 



P 



— det ^(t,0) 



>1, 



for all T in the support of <;. 

Applying Proposition 113.21 with ^ (r) — 6t- (0) and "0 (r) = ki {t)(;{t), we see that there exists 
S>landheLl (M") (with \\h\\^, < 1) such thatEl 



(g) = [g {dtx (")) - 9 (^t.o ("))] h (u) K„ {t, 0) ?{t) du 



,dt. 



Applying two changes of variables, we have. 



^^^^C 



1(9) ^ g{v)i det -^ {v) h (fl-^i («)) K„ {t, 0) ?-(*) dv 



dt 



g (v) I det ^ (v) j h [e-^ («)) Ko (t, 0) ^(i) dv dt 



Using that 9 (t, s, u) is C°° (uniformly in any relevant parameters, by Q2), we have 

fdet^(.))^(det^(.))+0(C), 



and therefore. 



der' 



T{g) ^ g{v)i det -^ (v) (h (e;} («)) - h {e^l {v))) ko {t, 0) ?{t) dv 



dt 



+ O (C) sup \g {v) 
veB"a(n') 



^"Actually, when applied to 9, (Cj) only requires we take a composition of no terms, instead of the n terms used to 
define 0. Since n > no the result still holds. 

^'^Of course, h is supported on the range O^ (0), and therefore supported in B"" {r]'). 
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Using J \ko (i, 0) <j (t)| < 1, we have, 



1(9) 



< 



sup 

\t\<a 



B"0(,7') 



{OT.M («)) - f' {(^2i iv)) 



dv 



C > sup \g (v) 

I ■ueS"0(r)') 



The proof wiU now be completed by showing, for every |i| < a (think of t as fixed), 

J\h{e-'^iv))-h(^e^^^{v))\ dv<c, 

for some e > 0. Define a measure S by, 

/ k (y, z) dE (y, z) = / A: (^"i (v) , ^'qI («)) 

so that the left hand side of (|14.11l) becomes, 

\h{y)-h[z)\dE[y,z). 



(14.11) 



dv\ 



Since Q^ } depends smoothly on (, 



't.X 



^rc(^)-^*:oW 



< 



c- 



Hence :^ is supported on those (y, z) such that \y — z\ < (^. Applying Proposition I13.3"l we see, 

\h{y)~h{z)\dE{y,z)<C, 
for some €>1. This establishes (114. lip and completes the proof. 

15 Completion of the proof 



In this section, we complete the proof of Theorem 17.21 using Theorem 114.51 We use the same notation 
as in Section [TOl and we therefore take Tj (for j G — logj A) as given by (jlO.ip . The goal is to show 
that the operator, 

converges in the strong operator topology as bounded operators on L^. This follows from the Cotlar- 
Stein lemma and the following result. 



Proposition 15.1. 



for some e > 0. 



||T,!T,|l^.^^.,||T,r,*|j^.^^.< 2-1^-^-1 



This section is devoted to the proof of Proposition 115.11 The proof is essentially the same for the 
two terms, and so we only exhibit the proof for \\T^Tj\\j^2_).]^2- Throughout, e > may change from line 
to line, but will always be independent of j, k. We will show. 



[T;nTtT,] 



< 2-^IJ~'=l 



L^^L^ 



(15.1) 



and the proof will be complete (with e replaced by e/4). 

Remark 15.2. In Section[TOl instead of squaring T*TkT^Tj, we raised it to the n+1 power. This will be 
implicit in this section, via our application of Theorem 114.51 Actually, the proof of Theorem 114.51 will 
(essentially) raise T*TkTj^Tj to a power larger than n+1. However, the proof could be easily modified 
(in this special case) to raise it only to the n + 1 power. This is not important for our purposes, though. 
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We set ja — j A k E — log2 A. Let {X, d) be the list of vector fields from Section [6l and define 
iz,d] — (2^^°X,'^d). We will be using this (Z.d) in our application of Theorem 114.51 Take /io so 
that, 

b'/jo ^ ^A'o I — li ~ '''loo • 

We assume first j^i„ > fc^^ and remark on changes necessary to deal with the reverse situation at the 
end of the section. We assume j^^ > fc^g as otherwise |j — fc| =0 and (115. 1|1 is trivial (it is easy to see 
that||T,H^,^^^,,||Tfc||^.^^. <1). 

First, we introduce the operators Si which we will use in our application of Theorem 114.51 We set, 

S*! • ■ • 57 = T*TkT^TjT*TkT^. 

Thus our goal is to show, 

First we claim that each Sj is controlled at the unit scale by (Z,dj (see Definition 114. II) . We prove just 
the result for Tj , the same proof works for Tk , and the claim follows since Si is controlled at the unit 
scale if and only if 5*;* is (see Remark ll4.2p . By a simple change of variables in (|10.ip . we obtain, 

T,f (x) = Vi (x) J f {j2-H (x)) iJ2 (72-.t (x)) K {2-H, x) ^j (t) dt. (15.2) 

Proposition 112.51 shows that (2^-"'X, ^d) controls 72-Jot at the unit scale. Since jo < j coordinatewise. 
Proposition 112.71 shows that (2^^''X, ^d) controls 72-Jt at the unit scale. That Tj is controlled at the 
unit scale by (2~-"'X, ^d) now follows immediately. 

We now turn to the other part of our assumption on the Si : that there are vector fields Zi , . . . , Z^ 
which generate Zi, . . . , Zq that appear at the Taylor coefficients of 



d_ 



e = l 



where 7; is the function defining Si, for some I. For this purpose, we need only use S2 — Tk and 6*4 — Tj. 
In light of (jl5.2p . we see 72.4 — 72-fct, 74. t — l2-it- Now the result follows from Lemma [12. 141 where we 
have taken 2^-'" = (5o, 2~-' — 5i, and 2^^ — 82- Thus, 5i, . . . , 6*7 satisfy all the hypotheses of Theorem 

ME 

Before we define i?i and R2, we need to make some preliminary remarks. First, for the reader only 
interested in the class of kernels /C, decompose t G K^ as i = (^1,^2), where ii = t^ (as defined in 
Section ing . Note, by the definition of f^, when t is scaled (i.e., when we consider 2^^^°t) all of the 
coordinates of ii are scaled by (at least) a factor of 2'^v^''o~ '^oj — 2^l^^'"'l°o, where c is some constant 
depending on the dilations e. The reader uninterested in K, may skip the rest of this paragraph. Since 
Jiio > ^Mo' i^ follows that /io is not j, ^-minimal (see Section |4?2)) . Letting C > 1 be as in Definition 14.41 
take /ii such that /io :^j,c,A fJ-i and there does not exist fi2 with /ii ~<j^c,A A'2- By definition, for every 
/i e [/ii]j c^' ^^ iiUMB, 

j^-KZ Jmo - fc^o > b' - fcloo ^ IJ - fc| ■ (15-3) 

For t G M^, decompose t = (ti,t2), where h = ^-'^'-^^•^•^ and tj = 4^''''^"^ (as in Section H^]) . By the 
definition of ti, when t is scaled (i.e., when we consider 2^~^°t) all of the coordinates of ii are scaled by 
(at least) a factor of 2"^ Ut^-k^) Jqj. gome /i G [/ii],(7^ (where c' depends on the dilations e). By (|15.3p . 

each coordinate of ti is scaled by at least a factor of 2''(^''o~''^o) = 2''l^~''l~. 

Note by the definition of t = (ii, ^2) in the previous paragraph, we have (by our assumptions on K), 

•ij (<) dti = 0. 
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In addition, we have from the discussion in the previous paragraph, if \2^ ^°t\ < 1, then 2'^" '''<^ \ti\ 

Ihl < a. I.e., (;f "'•' (i) is supported on 2=l-''~'''l~ \ti\ + jtzl < a. Letting ( = 2-"^^-''^^, we see that ■ 
may rewrite, 



(23 -jo) 



dC'h 



where C~^^i given by the usual muhiphcation, -^ — - is the Radon-Nikodym derivative of this muhiph- 
cation (and therefore is equal to C^™ where m is the dimension of the ti space), and ^ E L^ (S^ (o))? 
uniformly in any relevant parameters. 
Now consider, 

Tjf{x) =Vi (x) f f{l2-^o(tut2) (2^)) V'2 (72-.o(ti,t,) {x))n{2~^" (ti,i2),a;)4'' '"^ (^1,^2) dh dh 



V'i(x) 



f {l2-^0(t,M) (2;)) ^-2 {l2-H>(tut2) (2;))k(2 ^° {tiM).x) 



- f (72-Jo(o,t2) i^)) "^2 (72-ra(o,t2) C^^)) '« (^ ''" (0, ^2) , a:) 



(23-Jo) 



(tl,i2) rfil ^^2 



4>i (x) 



/(72-Jo(cti,i2) (2;)) ^-2 (72-™(cti,t2) (2;))«^(2 ^° (Cii,^2),a;) 



/ (72-Jo(oti,t2) i^)) ^2 (72-^0(0*1, i2) (2;)) «^ (2 ^" {Oti,t2),x) 



?(ii,i2) rf^i dt2. 



This shows that Tj is of the formi?i— i?2; indeed, we merely take 74^^42 ^^ = 72-ra(sti,t2) ^'^'^ ^ (^ii^2, s,x) = 
K (2^-"^ (5^1,^2) ix). It is easy to verify that these choices satisfy the relevant hypothesesCJ and we leave 
the details to the interested reader. 
Theorem ll4.5l applies to show, 

||5i • • • 57 (i?i - R2)\\l-^^l- ^ C~' = 2-^'l^'-'^-l» < 2-^"l^-'=l, 



for some e, e', e" > 0. This verifies (|15.1I) and completes the proof, in the case when j^^ > fc^j^. 
When kf^g > j^^ the proof is similar. To bound. 



{T;TunTj] 



L^^L^ 



we use the fact that ||2j||^2^^2 , ||T'fe||i2_j^2 < 1, and instead bound. 

We take, ^i • • • 5*5 = T*TkT^TjT* and we define Ri - R2 = Tk in exactly the same way as we did for 
Tj above. The rest of the proof follows in exactly the same manner as above. This completes the proof 
of Proposition [153] and therefore of Theorem 17.21 

16 Kernels revisited 

In this section, we further discuss the class of kernels K, {N, e, a. A) defined in Section U) None of the 
results here are used elsewhere in the paper, but we hope they will give the reader a better understanding 
of the class of kernels we use. 

Lemma 16.1. Every sum of the form (14.41) converges in the sense of distributions. 



^It is immediate to verify that 72-Jo(st t ) satisfies Q2, since 72-Jo(t t ) does. 
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,N\ 



It suffices to show 



Proof. Let C, A, a, and {(^j} be as in Definition 14.41 Fix f E Cq 

for some e > 0. Fix k = {ki, . . . ,ki,) E A. Note tliat tliere is an eo > sufficiently small, such that for 
all but finitely many j — (j'l, . . . ,j^) G — logj A, there is a /iq = Mo (j) such that 



\j -k\< coOmo -^mo) 



(16.1) 



We ignore those finitely many j such that p6.ip does not hold, and we also assume j ^ k. Fixing j and 
Ho such that (|16.ip holds, let /ii be such that /xq :<j^c,A Mi and such that there does not exist /i with 
/ii -<jfi^A M- Note that there exists ei > (independent of j) such that 

< Ij - A:| < eo (j^o - fc^o) < ei Om ~ ^m) ' foi' every /i e [mi]j,c,^ ■ (16-2) 

In fact, ei = 4^ will do. In light of (|16.2p . we see that /ii is not j, ^-minimal (since /ii £ [/ii] 
Thus, we have, 

%f ) (t) <'^-- = 0. 
Furthermore, on the support of (jv / (i), there is a /i G [mi] ^ ^ and 62, £3 > such that 



j.c.aJ 



[mi]j,, 



< a2-''^^^'^-'''''> < a2-''^^^-''^ < 2""^l^'l, 



where in the last inequality we have used that k is fixed. Thus, we see 



('') (t) f (t) dt 



(2 



'Ht)[f{t'l 






/(o,i 



2 



di 






di 



completing the proof. 

Next we turn to investigating the case when A = [0, 1]'^ . In this case, we have the following: 



D 



Lemma 16.2. Taking the same notation as in Definition \4.4\ with A = [0, 1] , the cancellation condition 
3)) holds if and only if 



(;j {t) dt'^^ = 0, unless j^ — 0, 
where i^ is as in the start of Section \4-l\ 



(16.3) 



Proof. This follows from a straightforward application of the definitions. Indeed, in this case — log2 A = 
N'^. The result follows from noting that if j — {ji, . . . ,j,y) € N" , then fj, is j, [0, l]''-minimal if and only 
if jp = 0. Furthermore, if j^ 7^ 0, then it is easy to see that there is no m' 7^ /i such that /i :^j^c,A m'j 
for any C > 1. ' ' D 

Proposition 16.3. So e JC {N, e, a, [0, l]""). 

Proof. Fix 77 e C^ {B^ (a')) with J rj = 1, where a' > is a small number to be chosen in a moment. 
For j eW, define <;j by 



_(2^) 



^ (_i)5:.^.^(2-''). 



(16.4) 



P6{0,1}" 
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It is clear that, for a' > sufficiently small, {q} C C^ [B^ (a)) is a bounded set. 
Fix j S N" and /i such that j^ / 0. We wish to show 

'ij (t) dt^ = 0, 

and to do so, it suffices to show 

(^f'''(i) di^=0. (16.5) 

Consider the sum (|16.4p . For each choice of ^' ^ ^ and p^' G {0, 1} (with j^/ — p^/ > 0), the sum 
(116. 4p contains two terms: one for p^ = 1 and one for p^ = 0. These two terms have opposite signs, and 
therefore the sum of their integrals against df^ is 0. (|16.5p follows. 
To complete the proof, we must show 



E4^^-^o. 



In light of Lemma ri6.1[ it suffices to show, for each to > 0, 



y: ,f)=^(^"-.^"), (16.6) 

since 77'^ '■'^ ^ — > (5o in distribution as m —> 00. 

To see ()16.6p . fix j G N'^ with at least one coordinate less than to and the rest less than or equal 
to TO. Let z/Q denote the number of coordinates of j that are equal to to,. It is easy to see that the 
coefficient of t;^^ ) in the sum (|16.6p is equal to 

I/— I/O / \ 

whereas the coefficient of t]^^"'"'^"'-' is 1. This completes the proof of (116. 6p and therefore the proposi- 
tion, n 

Lemma ri6.2l will allow us to see product kernels as the special case of /C {N, e, a, [0, 1]"^) when each Cj 
is nonzero in precisely one coordinate. We begin by introducing the notion of a product kernel. Our main 
reference is [NRSOlj and we refer the reader there for more details. The definition for product kernels 
is recursive on the number of products, v, with 1^ = 1 corresponding to the classical Calderon-Zygmund 
kernels. To define product kernels, suppose we are given a decomposition of M.^ , M.^ = R^^ x • • • x M.^'^ , 
into ly homogeneous subspaces with given single-parameter dilations on each subspace. That is, for 
each /i we are given e^, . . . ,e'^ S (0, cxd). For S > and t — (ti, . . . , tN^) S R^f , we define St = 

(S'^^ti, . . . , 5'^^'^t^ J . This, therefore, defines v parameter dilations on M^: \i 6 — ((5i, . . . , 6^) & [0, 00)'^ 
and i = (i\ . . . , t'') e R^i X • • • X M^- = M^, we define 

St^ {6it\...,da'')- (16.7) 

Definition 16.4 (See Definition 2.1.1 of |NRS01) ). A product kernel on M^ relative to the above 
decomposition is a distribution K on R^ which coincides with a C°° function away from the coordinate 
subspaces t^ — and which satisfies: 

1. (Differential inequalities) For each multi-index a — (ai, . . . , a^)l^ there is a constant Cq so that 

|af,^ ■ • ■ dpK {t)\ < c„ \ty^i'+"iy^ . . . \t-\-U^+<y. , 

away from the coordinate subspaces. 



*^We write a,, = I a},, . . . ,a,,'' 
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2. (Cancellation conditions) These are defined recursively on v, the case v = \ corresponds to the 
usual Calderon-Zygmund operators. 

(a) For V — 1, given any C°° function (j) supported on the unit ball with C^-norm bounded by 1, 
and any _R > 0, 

K (t) 4) [Rt) dt 



is bounded independently of and R. 

(b) For V > 1, given any /i, 1 < /i < i', any C°° function supported on the unit ball of R^f" 
with C^ norm bounded by 1, and any /? > 0, the distribution 

K^^R{t',...,t^-\t^+\...X)= f K{t)4{m^) dt^ 

is a product kernel on the lower dimensional space which is a product of the R f' for fj! ^ n, 
uniformly in (p and R. 

Let e denote the multi-parameter dilations induced by (J16.7I) . Then we have the following result: 

Proposition 16.5. Suppose K is a distribution supported in B^ (a). Then, K is a product kernel as 
in Definition \16.4\ if and only if K (z JC (iV, e, a, [0, 1] ). 

Proof This follows easily from Corollary 2.2.2 of jNRSOlj . D 

We now turn to the notion of flag kernels. Flag kernels provide an easy to understand situation 
where one uses a choice of A other than [0,1]'^. For flag kernels, we take the same setup as above 
(flag kernels are special cases of product kernels). Here we think of the factors M^^ x • • • x R^" as 
being ordered (for the product kernels, the order of the factors did not matter). Again, we write 



X • • • X R^- 



t=(i\...,r)e 

Definition 16.6 (See Definition 2.3.2 of |NRS01] ). A flag kernel is a distribution K on R^ which 
coincides with a C°° function away from f^ = and satisfies 

1. (Differential inequalities) For each multi-index a = (ai, . . . , a,, r^l there is a constant Cq, such that 
for i" 7^ 0, we have 

|9,T • ■ • dpK{t)\ <c„ (|ti| + • • • + |t-|)-^(i+"i)4 

•■• (|t''-l| + \t-\)-^('+<-^>r' |i.|-E(l+ai)eJ 

2. (Cancellation conditions) These are defined recursively on v. 

(a) For u = 1, given any C°° function (p supported on the unit ball, with C^ norm bounded by 
1, and any i? > 0, 

K (t) (t> {Rt) dt 



is bounded uniformly for <j) and R. 

(b) For V > 1^ given any n, 1 < jJ. < i^, any C°° function cj) supported on the unit ball of R^*" 
with C^ norm bounded by 1, and any R > 0, the distribution 

K^^n (t\ . . . , t^-\ t^+\ ...X)^ I K{t)4 {Rt^^) dt^^ 



is a fiag kernel on R^i x • • • x R^f-i x R^"+i x • • • x R^", uniformly in and R. 
We state the following result without proof. It is a result of Nagel, Ricci, Stein, and Wainger. 



** Again, we write a^ = ( a]^ , ■ ■ ■ , ot^ '' 
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Proposition 16.7. Let A = {{Si, . . . , 5^) ^ [0,1]'^ : Si > S2 > ■ ■ ■ > 61,}. Suppose K is a distribution 
supported on B^ (a). Then K is a flag kernel as above if and only if K is an element of IC {N,e,a, A). 
Here, e are the multi-parameter dilations defined earlier. 

We now present an interesting example of kernels, which are neither product kernels nor flag kernels, 
but are still of the form IC {N, e, a, [0, 1]''). We work in the case A^ = 3 and we think of M.^ as being iden- 
tified with the three-dimensional Heisenberg group, H^ (though our example can be easily generalized 
to any stratified Lie group). If we write (x, y, t) €M.^ = H^ as coordinates, then the group law of H^ is 
given by 

{x, y, t) (x', 2/', t') ^{x + x',y + y',t + t' + 2 {yx' - xy')) . (16.8) 

We take v = 2 and consider the two parameter dilations given by ei = (1,0), 62 = (0,1), and 63 — 
(1, 1)Cj That is, we take the two parameter dilations given by, 

{6i ,62)ix,y,t)^ {Six, S2y, SiS2t) . (16.9) 

It is easy to see that these dilations are automorphisms of the Heisenberg group. It follows from our 
theory that if X e /C ( 3, e, a, [0, 1] ) , then the operator given by / i-^- / * X is bounded on L^ where the 

convolution is taken in the sense of H^ . We will see in Section 117.81 though, that when the convolution 
is taken in the sense of the usual Euclidean structure on R^, then the operator may not be bounded on 

17 Examples 

In this section, we present a number of examples to help elucidate the various aspects of Theorem 17.21 

17.1 Exponentials of vector fields 

Let A C [0,1]"^, Kq (e n, and e be as in the statement of Theorem 17.21 For each multi-index, a, 
< |a| < M, let Xa be a C°° vector field on il. Define a function, 

7t (x) = e^o<i<.i<M*°'^°a;, (17,1) 

We say 7 satisfies (V), if the following conditions are satisfied: 

• {{Xa,deg{a)) : deg (a) is nonzero in precisely one component} generates a finite list (X,d). See 
Definition 15.31 



• The above obtained list {X,d) controls {Xa,deg{a)) for every a with deg (a) nonzero in more 
than one component (and thus for every a). 



Proposition 17.1. 7 satisfies (V) if and only if ^ satisfies the assumptions of Theorem \7.2\ 

Proof. This is a combination of Propositions 112.161 and 111.121 D 

17.2 Nilpotent Lie groups 

In this example, we investigate a special case of the previous example. In this example we take A = 
[0, lY , and take Kq (e fl and e as in the previous example. Recall, we call a a pure power if deg (a) 
is nonzero in precisely one component. Fix M large and for each a, < \a\ < M, which is a pure 
power, let Xa be a C°° vector field. For the non-pure powers, let Xa — 0. Assume furthermore, that 
the iterated commutators of the Xa are eventually 0. Then, if 7 is given by (jl7.ip . 7 trivially satisfies 
(V) and therefore satisfies the assumptions of Theorem 17.21 



*^The resulting kernels are not product kernels, since 63 is nonzero in more than one component. These dilations are 
sometimes referred to as Zygmund dilations-see IFP97I . 
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To give a concrete example of the above, let Xi, . . . ,Xi be left invariant vector fields on a nilpotent 
Lie group. Consider the operator given by, 

f^ij{x)ff (e*i^i+*^^^+-+*'^'x) K {ti, . . . ,ti) dt, (f 7.2) 



where K {ti, . . . ,ti) is a product kernel relative to the decomposition ofM'=MxMx---xK (and K 
is supported sufficiently close to 0). Then, ii ip ^ C^ has small support, the operator given by (jf7.2p is 
automatically bounded on L^. 

Furthermore, by a simple scaling argument, if the Lie group has an appropriate family of dilations 
under which the vector fields are homogeneous (for instance if it is a stratified Lie group), then one may 
replace ip with 1 on the right hand side of (|f7.2[) and remove the restriction that K have small (or even 
compact) supports 

Note that we have not involved terms like tit2X in the exponential. These would not be pure powers, 
and so (V) would not hold vacuously: one would need an additional condition on the vector fields to 
ensure that (V) holds. In Section Fl7.5l we will discuss a translation invariant operator in Euclidean space 
of the form (|17.2|) . except with non-pure powers as well, where (V) does not hold, and moreover which 
is not bounded on L^. 

Remark 17.2. We will see in [SSI lb] that the vector fields Xi,. . . ,Xi can be replaced with any real 
analytic vector fields; not just left invariant vector fields on a nilpotent Lie group. In particular, they 
could be replaced by left invariant vector fields on any Lie group. 

17.3 The L^ result of Christ, Nagel, Stein, and Wainger 

In this section, we show that the L^ result of [CNSWOO' (the p = 2 part of Theorem 19. 2p is a special 
case of the main result of this paper (Theorem I7.2p rl Thus, we are considering the single-parameter 
case V — 1. We take A = [0, 1] and fix single parameter dilations e = (ei, . . . , cat), Cj S (0, oo), and 
Kq g fi as in Theorem 17.21 In [CNS W99] . Cj is taken to be 1 for every j, but this is not essential to 
their work. 

The condition assumed in JCNSW99] on 7 was that if X^ was given as in (|6.ip . then {Xa] satisfies 
Hormander's condition at every point x 6 Kq. See {Cz) in Section[9l 

Proposition 17.3. Under the above hypothesis, {Cz), 7 satisfies the hypotheses of Theorem \7.2\ 

Proof. Let {{Xi,di) , . . . ,{Xr,dr)} be a finite subset of {{Xa,deg{a))} such that Xi,...,Xr satisfy 
Hormander's condition. Assume that Xi, . . . ,Xr satisfy Hormander's condition of order m. That is, 
Xi, . . . , Xr along with their commutators up to order m span the tangent space at each point of Kq. 

We recursively define single-parameter formal degrees on the commutators of {Xa,deg{a)) in the 
usual way: if Y has formal degree di and Z has formal degree d2 , we assign to [Y, Z] the formal degree 
di + d2. Enumerate the list of all commutators of Xi , . . . , Xr up to order m along with their above 
defined formal degrees, 

(Xi,di),...,(Xi,dL); 

so that Xi, . . . , Xl span the tangent space at each point. Let (^1, di) , . . . , {Xq, dq) be an enumeration 
of all the vector fields that can be written as a commutator of the Xa and such that their above defined 
formal degree is less than or equal to maxi<;<L d/Cj It is easy to see that there are only a finite number 
of such vector fields. Note, we are using in an essential way that the degrees are numbers, unlike in the 
multi-parameter case, where they are vectors. 
We claim that, for 1 < i,j < q, 

[X,,X,]^ J2 cl^Xk, cl^^C^. (17.3) 



*®By this we mean we can take any product kernel K as in Definition 116.41 without restricting our attention to those 
kernels with small support. This is the same as considering kernels in K {N, e, a, [0, 00)") where e is chosen as in Proposition 
I16.5l and a > is any real number. In this case, IC is the same as K. 

^'^ Similar remarks hold for every 1 < p < 00, and this will be shown in ISSllal . 

^*Wc maintain the notation that (Xi, di) , . . . , (X^, dj^) are the first L of these vector fields with formal degrees. 
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Indeed, if di + dj < niaxi<;<L di, then {[Xi, Xj] , di + dj) is already in the list {Xi, di) , . . . , (Xq, dq) by 
definition. On the other hand, if di + dj > maxi</<L d/, we use the fact that, 



i^^^^j] =^cljXk 



k=l 

since Xi, . . . , Xl span the tangent space at each point. It was shown in Section [3] that the list {X, d) — 
{Xi,di),...,{Xq,dq) satisfies V {Kq, [0, 1]) if it satisfies (pTSj) . 

Hence, to complete the proof of the proposition, we need only show that the list {X,d) controls 7. 
Let W {t, x) be as in Definition l6.3[ so that the vector fields Xa (x) are the Taylor coefficients of W {t, x), 
when the Taylor series is taken in the t variable. Then we have, 

Wi6t,x)= J2 t°'S'^'"^'^'"'^X^ + o(\Stf'y (17.4) 

q|<J\-/ 

Note, each term, 

^dcg(a)^^ = J2 ciXk, 4 e C°° uniformly in 5. (17.5) 

fc=i 
Indeed, if deg (a) < maxi<;<L di, then {Xa, deg (a)) appears in the list {Xi, di) , . . . , (Xq, dq), by def- 
inition. On the other hand, if deg (a) > maxi<;<L di, then we use the fact that Xi, . . . , X^ span the 
tangent space, and write, 

L 

Xa=Y.ciXi, ci^C^. (17.6) 

Multiplying pT^ by ,5'^°s(a) immediately yields ([T731) . 

Thus, each term of the sum on the right hand side of (117. 4p has the desired form for the definition 
of control. To complete the proof, we need only show that the term O f |^i| 1 has the desired form, 
provided M is sufficiently large. Take M so large that, 

M min e, > max di. 

l<j<N ^ 1<1<L 

Then, 

Using one last time that Xi, . . . , Xl span the tangent space, we see that we may write the O I |(5t| ) 
term in the form, 

L L 

J2 4 {t,x) <5'°'^'^i<'<^ 'i'Xk ^J2^i^'^'^k, 

fc=i fc=i 

with 4 £ C°° uniformly in S. This completes the proof that 7 is controlled by {X, d), and therefore the 
proof of the proposition. D 

Remark 17 A. As remarked in Section [31 even in the single-parameter case. Theorem 1 7. 2 1 is more general 
than the L^ result from ^CNSW99^ . Indeed, the result in this paper also applies in some situations when 
7 lies in the leaves of a (possibly singular) foliation. To apply the methods of [CNSW99] directly to a 
foliated manifold, one would need the foliation to be non-singular. See Section [3] for more details. 

17.4 How the assumptions can fail 

It is perhaps instructive to understand examples of 7 where our assumptions do not hold. In this 
section, we exhibit 7 which exemplify the various ways the assumptions can fail. We make no claim 
about whether or not the associated operators are bounded on L^, and only claim that our theorem 
does not apply. 

In the single parameter case {v — 1 , A — [Q ,1]) , there are three ways in which 7 might fail to satisfy 
our assumptions. 

53 



1. The vector fields Xa (see (j6.1|) ) might fail to generate a finite list. In the single-parameter case, this 
is essentially equivalent to failing to generate a locally finitely generated involutive distribution (see 
LemmaE^. For instance, let Xi = dx, X2 = e~^dy, then the function 7 (i, {x, y)) : R x M^ — )- R 
associatecO to the vector field W given by 



W{t,(x,y))=tX,+fX2, 



has this property. 



2. If the Xa generate a locally finitely generated involutive distribution, this distribution foliates the 
ambient space into leaves. Our assumptions require that 74 (x) lies in the leaf of this foliation 
passing through x for every t. This is not always the case. For instance, 7t (x) : M x M — > R given 
by, 

7 (i,a;) : a; H> X — e~t^, 

does not lie in the appropriate leaf. In this case, all the Xa are (and so each leaf is merely a 
point). 

3. Even if the Xa generate a locally finitely generated involutive distribution, and 74 (x) lies in the 
appropriate leaf, it may still be that 7 is not controlled by the list {X, d) . We work on R. Define 
the vector field W {t, x) by, 

W{t,x) ^te^^dx + e'^xdx. 
Let 7i be function associated to this W (via Proposition 112. ip . Note that, 

{negative, if x is negative, 
zero, if x is zero, 

positive, if x is positive. 

In this case, there is only 1 Xa, namely Xi — e~^dx- Thus, the leaves of the foliation are 

(-00,0), {0}, (0,00). 

Hence, the proof will be complete if we can show that 7 is not controlled by ( e^^c^a,, 1 ]. If 7 

were controlled, it would imply, in particular, that there exists a ip 7^ such that for every x near 

0, we have, 

_ 1 J 

e *o x = c (x) e~^, 
with c (x) bounded uniformly as x — > 0. This is clearly impossible. 

Note that in each of the above examples, we used functions that vanished to infinite order at a point. 
One might wonder if such a phenomenon is essential in creating an example where our assumptions fail 
in the single parameter case. Indeed, it is essential. It turns out that, when restricting attention to real 
analytic 7, our assumptions are automatically satisfied in the single parameter case. This is taken up 
in |SSllb| . 

We now turn to our multi-parameter assumptions. Here, there is a new phenomenon which appears, 
even when considering real analytic 7: one may have that the vector fields corresponding to the non-pure 
powers are not controlled by the pure powers. For instance, consider the operator on R given by, 

f ^tP{x) f f{x-st) K{s,t) dsdt, (17.7) 

where K is a. product kernel relative to the decomposition (s, i) G M x R (and K is supported near 
0). Here, there is only one vector field d^ given degree (1, 1) (we are using the standard two-parameter 
dilations on R x R). Thus, this example does not satisfy our assumptions. In fact, the methods in 
Section 117.51 can be used to show that there is a i^ such that (|17.7p is not bounded on L^ . 



^®See Proposition 112. n for the bijective correspondence between 7 and the vector field W. 
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17.5 A translation invariant example 

In this section, we investigate some translation invariant operators on M, in the two-parameter setting. 
In particular, we will investigate operators on L^ (M) given by, 

TkJ {x) :^ J f{x-p (s, t)) K (s, t) ds dt, (17.8) 

where p is a polynomial without a constant term and K is a. product kernel on R^. That is we are taking 
the two-parameter dilations e — ((1, 0) , (0, 1)), so that (61,62) (s, t) = {6is, 62t) and we are considering 

isT G /C ( 2, e, ao, [0, 1] 1 for some small oq > 0. 

Remark 17.5. Note we have not included a cutoff function in (|17.8p . Since K has small support, the 
value of TkJ [x) only depends on the values of / near x. It is not hard to see, from this and the 
translation invariance of Tk, that Tk is bounded on L^ if and only if ijjTk is bounded on L^ for some 
C^ cut-off function ^ with 7/; = 1 on a neighborhood of 0. 

Remark 17.6. We have restricted to one-dimensional, two-parameter operators for simplicity. The same 
methods in this section apply to higher dimensional higher parameter operators. 

Remark 17.7. Results like the ones in this section date back to |NW77| . Thus, the results in this section 
are partially expository, though our perspective is somewhat different. 

Write 

V{s,t) = ^ Ca{s,t)" . 
q|>1 

Let a > be the lowest a such that C(a,o) 7^ 0, and b be the lowest h such that C(o,6) 7^ 0. For simplicity, 
we assume a, & < 00, but this it not necessary for what follows. 

Theorem 17.8. Tk is hounded on L? for every K with sufficiently small support if and only if whenever 
Cfe.f) 7^ 0, we have that (e, /) lies on or above the line passing through (a, 0) and (0,6). 

Proof. The if direction follows from Proposition [T7TT1 Indeed, if we set 7(s_t) {x) = x — p{s,t), then, 

7(.,t) [x) = e-P(^'*)^==:r. 

To see that Proposition 117. II applies, one merely needs to note that [dx, (a, 0)) and {d^, (0,&)) control 
{dx, (e, /)) if and only if (e, /) lies on or above the line passing through (a, 0) and (0, h). This is easy to 
see directly, and is also covered in Example 5.16 of |Strllj . 

For the only if direction, we will show if there exists C(ej) ^ and ^e -I- -^J < 1, then there exists a 
product kernel K with small support such that Tk is unbounded on L^. Let e, / be such that C(^ej) 7^ 
and -e + -r/ is minimal among all such (e, /). Furthermore, we assume that e is minimal among these 
choices. Note that, by assumption, 7^ e, /. 

First we wish to reduce the question. Note that product kernels are a Frechet space (see, for instance. 
Definition 116.41) . Hence, by closed graph theorem, if Tk were bounded for each choice of K, then the 
map K i^ T would be continuous as a map from the space of product kernels to the space of bounded 
operators on L^ (given the uniform operator topology). Hence, to prove the theorem, it suffices to 
construct a sequence {Kj} of product kernels such that, 

• {Kj} is a bounded set in the space of product kernels. 

• The Kj all have arbitrarily small support. 

• II^^J L2^L2 ^ 00 as j ^- 00. 

Take i.^C^ ((0, 1)) such that / <r = 0, and 

exp (ic^ej-fsH^) •; (s) <j (t) ds dt ^ 0. (17.9) 
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Note that such a <^ clearly exists. Indeed, take 77 e C(^ (0, 1) such that, 

r] {t) dt^O, / 77 (s) s'^ds^O, rj {t) t^ dt ^ 0. 

We let 77'''^ (t) = rrj (rt). Then, we have, 

exp (icejs'^tf) ryf'') (s) ry^'') (t) ds dt = 
i( I T]{s)s^ ds) ijri{t) t^ dt\ C(e,/)r"'="^ + O {r'"'^'^^) , as r ^ 00. 

Taking <; = t]'^^^ for r sufficiently large, we obtain (jl7.9p . 
For M large, and t >> M define, 

0<j<M 

k=-3f 

Here, j is an integer, but k need not be. Note that Kr,M is a product kerneP*^! uniformly in r and M. 
Furthermore, for M fixed and r >> M, Kr,M has small support. We will show. 



lim sup lim sup 1 1 Tk^ ^ 

Af-i-OO T— !-00 



(17.10) 



completing the proofP^I 

Let mo = ^e + -^J. To prove (|17.10p . it suffices to show. 



lim lim 

M— S-OO T— >c» 



exp {ip (s, t) T"^'>)Kr,M (s, t) ds dt 



since this is just the absolute value of the multiplier of Tk^ m evaluated at r™° . 
Note that. 



/ exp [ip (s, t) T'^°)KrM (s, t) ds dt 



0<j<M 

k=-3j 



0<j<M 
k=-3j 



^ / exp [ip (r-^s, r-h) r"°) ^^^') {s) ^(2') (i) ds dt 



(17.11) 



0<i5 



E /exp|z E C(,,„)(s,i)(^''')2^(''7-^)]^(s)^(i)dsdi. 



^''For the Kr,M to be product kernels, it is not necessary that 2^ra be of the form 2' for some I (and similarly for 

^-"^ Actually, the methods below imply limjv/— >c>o liminfr— >oo ^x^ j^i = 00, but this stronger statement is irrele- 

II ^' II L'^ — ^Ij2 
vant for what we wish to prove. 
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Let ( J2 1=0 ' ) denote the final equation in (|17.11l) . We will complete the proof by showing, 



lim 

M—t-oc 



M 

E 

J=0 



To do so, it suffices to show, 

^Ihn /"exp [ i Y. %.M i.s.ti'^''^ 2^>7-5) \^{s)^ (t) ds dt ^ 0, 



i9+Th=rno 



i.e., this limit exists and is nonzero. Note, when (g, h) = (e, /), we have hj — g = 0. Otherwise, if C(g /j) 
is nonzero and ^5 + ■j^ = 'tt-Oi we have hj — g < 0, hy the minimality of our choice for e. Thus, 



^hmyexpji Y. Hg,h){s,t)^''''^2^^''7-'^<;{s)<;{t)dsdt 



exp I 

by our choice of <7; this completes the proof. D 

Remark 17.9. If we had not restricted our attention to K with small support, we could have reversed 
the argument (taking, instead, r small) and shown that there exists a product kernel K such that Tk 
is unbounded if there is a C(ej) ^ with (e,/) lying above the line passing through (a',0) and (0,6'), 
where a' is the maximal a' such that cra',o) ¥" 0: ^-nd similarly for b' . 

Remark 17.10. Theorem 117.81 exemplifies the need to distinguish between pure powers and non-pure 
powers. In this case, the pure powers are those a G -^ (a, 0) , ( 0, 5) > such that c^ ^ 0, while the non-pure 
powers are remaining a such that c^ ^ 0. 

Remark 17.11. The reader familiar with |CWW06] will note that the necessary and sufficient conditions 
for the boundedness of Tk, where K [s,t) = —r are much more complicated than the conditions in 
Theorem 117.81 However, there are similarities between the results in |CWW06] and Theorem 117.81 
Indeed, the assumptions on p in Theorem ll7.8l can be restated as saying that the vertices of the Newton 
diagram of p lie on the coordinate axes. 

We now turn to contrasting this with the flag kernel setting. We take all the same notation as above, 
but set, 

^={((5i,<52)e[0,l]':,5i<(52}. 

We will be considering operators Tr- such that K ^ IC{2, e, oq, .4) for some small oq > 0. 
We take p and < a, 6 < oo as above. We assume that b < a. We have. 

Theorem 17.12. In the above setting, Tk is bounded on L^ for every K e /C(2,e,ao,^) (with aq > 
sufficiently small) if and only if for every Ca ^0, a lies on or above the line passing through (6,0) and 
(0,6). 

Proof. The if direction follows from Proposition [T7TJ Indeed, one needs only verify the straightforward 
fact that {dx, (0, 6)) controls {dx, a) if and only if a lies on or above the line passing through (0, 6) and 
(6, 0). Recall, the definition of control depends on the set A. 

The only if follows just as in the proof of Theorem ll7.81 replacing a with 6 throughout. It is easy to 
see, that when a is replaced by 6, the kernels Kt,m are uniformly flag kernels. D 

Remark 17.13. We will see, in |SSlla| . that the operators in Theorems ll7.8l and ll7. 121 which are bounded 
on LF' are also bounded on L'' (1 < g < cxd). 
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17.6 Flag kernels versus product kernels 

As mentioned earlier, if we are considering operators associated to product kernels, 

f^ij{x)Jf{jt{x))K{t) dt, 

then Theorem 17.21 holds for a larger class 7 when K is assumed to be a flag kernel (a special case of 
product kernels) . An example can be seen by contrasting Theorems 117.81 and 117.121 

In this section, we exhibit another, simpler example of 7 where our assumptions hold for flag kernels, 
but not product kernels. Unlike Theorems 117.81 and 117.121 we make no claim about the unboundedness 
of operators to which our theorem does not apply. 

We consider product (and flag) kernels on R^ supported near the origin. We take the standard 
two-parameter dilations on M^: 

(61,62) {s,t)^ (Sis, S2t). 

Thus, e = ((1,0), (0,1)). Let A = [(61,62) G [0,1]' : 61 < 62]. Let X(i.o) = d,, X(2,o) = er^dy, and 
X(o,i) = dy. Define, 

7(s,t) {x,y) = e^-^<i'«)+^'^<^'")+*^(".i) (x,y) . 

Then, for K € IC(2, e, a, A) (for a > sufficiently small) Theorem l7.2l (see also Proposition [TTTl]) applies 
to show that operators of the form 

f ^tp(x,y) / / (7(s,t) (x, y)) K (s, t) ds dt 

are bounded on L^, where "0 is a cut off function supported near G K.^. 

However, it is easy to see (again via Proposition [T7jT]) that the assumptions of Theorem 17.21 are not 

satisfied for the full class of product kernels: /C ( 2, e, a, [0, 1] 



Remark 17.14. The example in this section used functions that vanish to infinite order. However, we 
see by contrasting Theorems 117.81 and 117.121 that even when 7^ (x) is real analytic, it could be that 
operator corresponding to each flag kernel is bounded on L^, while there are operators corresponding 
to product kernels which are not bounded on L^. 

17.7 A multi-parameter difficulty 

One of the crucial notions that we used was when a list of vector fields with formal degrees (Xi, di) , . . . , (Xr, dr) 
"generated a finite list," see Definition 15.31 In the single parameter case (z^ = 1, ^ = [0, 1]), we saw in 
Lemma 13.21 that this notion is essentially equivalent to the set {Xi, . . . ,Xr} generating an involutive 
distribution which is locally finitely generated as a C°° module. The point of this section is to exhibit 
an example showing that the obvious multi-parameter generalization of the above does not hold. 

More specifically, let :/ = 2 and A — [0, 1] . Suppose we are given two lists of vector fields with 
two-parameter formal degrees, (Xi,di) , . . . , (Xr, dr), (Yi,d[) , . . . , (Yr',d'^,). We assume that di, . . . ,dr 
are all in the second component, while d[, . . . , d'^, are all in the first component. Suppose we wish 
to show that, 

(X,,di) ,..., (Xr,dr) , (Fl, d'l) , . . . , (Yr.,d'r,) (17.12) 

generates a finite list. In analogy with the single-parameter case, one might hope that the following 
(necessary) condition would be sufficient to prove that (J17.12I) generates a finite list: 

• The involutive distribution generated by {^i, . . . , Xr} is locally finitely generated as a C°° module. 

• The involutive distribution generated by {Yi, . . . , F,./} is locally finitely generated as a C°° module. 

• The involutive distribution generated by {Xi, . . . ,Xr,Yi, . . . ,Yri} is locally finitely generated as 
a C°° module. 
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This is not the case. 

We exhibit an example where the above hold, but which does not generate a finite list. Our vector 
fields are, 

(a.,(l,0)),(e-^9„(0,l)),(5„(0,2)). 

Let g (x) = e^^, and let g'™) denote the m-th derivative of g. If the above vector fields generated 
a finite list on a neighborhood of 0, then, in particular there would exist an m such that for every 
1 >'5>0, 

j<m 

where c^ and c are bounded uniformly in (5 > and a; > 0. To see this we have taken the special case 

(1,6) e [0, 1]^ (fT7l3l) is clearly impossible. 

Remark 17.15. Once again, we have used functions which vanish to infinite order at a point. This is 
essential, in the sense that the above phenomenon does not occur in the real analytic category. We will 
see this fact in [SSllbj. 

17.8 An example on the Heisenberg group 

In this section, we discuss in more detail the example given at the end of Section [161 The operators 
considered in this section are less singular than those in Section FIT. 2 1 We work on the three dimensional 
Heisenberg groupl^j H^. As a manifold M^ is diffeomorphic to M'^. We write {x,y,t) for coordinates of 
H^. The group law on H^ is given by (|16.8p and we define two parameter dilations on H^ by (116. 9|) . 
The Lie algebra of M.^ is three dimensional, and one may take as a basis, X = dx + '2ydt, Y = dy — 2xdt, 
T = dt. 

Given a bounded subset {<,j} .^^2 C C^ {^B^ (1)) satisfying. 



<^j (x, y, t) dx dt^O, I c,j (x, y, t) dy dt, Vj, 
we define a distribution by, 

K {x, y,t)= J2 22J"^+2J"^^, (2^'^x, 2^^y, V^+'H) - ^ ^f'^ (x, y, t) . 

Theorem 17.16. The operator given by, 

is bounded L^ {U^) -^ L^ (M^) . 

It is not hard to see that the operator in Theorem ll7.16l is essentially of the form covered by Theorem 
17.21 The differences are: we have not restricted to K with small support, and we have not localized the 
operator. However, using the dilation invariance of the class of operators in Theorem 1 17. 161 it is easy to 
see that Theorem 117. 161 follows from the result for K with small support and T localized. Alternatively, 
our entire proof goes through, essentially unchanged for the operators in Theorem 117.161 one does not 
need to use the small support of K or the localizing functions in this case. We leave the details to the 
reader. 

At this point we wish to discuss the proof of Theorem 17.21 in the context of Theorem ll7.16l Indeed, 
define the operator. 

It is not hard to see that T*Tj is essentially of the same form as Tj. Thus, at first glance, one might think 



that a T*T iteration type argument would not be useful in proving Theorem 117.161 On the contrary. 



^The methods discussed here generahze to any stratified nilpotent Lie group. 
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however, a T*T iteration argument was essential to our proof. The idea is that when j, k E 7/^ and 
ji > ki and J2 < ^2, the operator {T*Tk) T*Tk has a less singular Schwartz kernel than T*Tk. This is 
the essence of our proof. 

We close this section with one final remark. If the convolution in Theorem 117.161 is replaced by the 
Euclidean convolution on W^, then the operator may be unbounded on L^|ff| This further accents the 
need to distinguish between pure powers and non-pure powers. The point is, because of the dilations 
P6.9|) . X and y are pure powers, while i is a non-pure power. In the Heisenberg group case, the vector 
fields corresponding to the pure powers are {X, (1,0)), {Y, (0, 1)). The vector field corresponding to the 
non-pure power is (T, (1, 1)) = (| [-'^i i^] , (1,0) -I- (0, 1)) and therefore is controlled by the commutator 
of X and Y. In the Euclidean case, the vector fields corresponding to the pure powers are {dx, (1,0)) 
and {dy, (0, 1)); while the vector field corresponding to the non-pure power is (dt, (1, 1)) which is not 
controlled by the previous two. 

Finally, to see the operator might not be bounded on L^ if the usual Euclidean convolution is used, 
we need only show that there is a X as above whose Fourier transform is unbounded. To see this, let 
e C^ ((-1, 1)) be such that 0(0) = 1. Let ip € C^ ((-1, 1)) be such that J ip = 0, V' (1) > 0. Define, 



^ii,i2 (2;, y, t) 



(x) (y) V; (t) 




if ji = -h, 
otherwise. 



It is easy to see that these <rj satisfy the hypotheses of Thcorcm ll7.16l We have, 

i? (0,0,1)= Y. ^(i) = oo. 



h=-h 



completing the proof. 
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